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SULFATED CCR5 PEPTIDES FOR HIV-1 INFECTION 

5 This application is a continuation-in-part of and claims 
the benefit of U.S. Provisional Application No. 60/267,231, 
filed February 7, 2001, U.S. Provisional Application No. 
60/205,839, filed May 19, 2000 and U.S. Provisional 
. Application No. 60/185,667, filed February 29, 2000, the 
10 contents of which are hereby incorporated by reference into 
this application. 

The invention disclosed herein was made with Government 
support under NIH Grant Nos . R01A143847 (T.D.) and 
15 R01DK54718 (T.P.S.) from the Department of Health and Human 
Services. Accordingly, the government has certain rights in 
this invention. 

Throughout this application, • various publications are 
20 referenced within parentheses. Disclosures of these 
publications in their entireties are hereby incorporated by 
reference into this application to more fully describe the 
state of the art to which this invention pertains. Full 
bibliographic citations for these references may be found 
25 immediately preceding the claims. 

Background of the Invention 

HIV-1 entry into target cells is mediated by the successive 
interaction of the envelope glycoprotein gpl20 with CD4 and 
30 a co-receptor belonging to the seven trans -membrane G 
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protein-coupled chemokine receptor family (Berger et al . 
Ann. Rev. Immunol. 17:657, 1999). Binding of gp!20 to CD4 
exposes or creates a co-receptor binding site on gpl2 0 
(Trkola et al . Nature 384:184, 1996, Wu et al . Nature, 
5 384:179, 1996). CCR5 and CXCR4 are the most physiologically 
relevant and widely used HIV-1 co-receptors (Zhang and 
Moore, J. Virol. 73:3443, 1999). CCR5 mediates the entry of 
R5 isolates and CXCR4 mediates the entry of X4 isolates. 
R5X4 isolates are able to exploit both co-receptors (Berger 
10 et al. Ann. Rev. Immunol. 17:657, 1999). It has been 
demonstrated that specific amino acids including acidic 
residues and tyrosines located within the CCR5 amino- 
terminal domain (Nt, amino acids 2-31) are essential for 
CCR5-mediated fusion and entry of R5 and R5X4 HIV-1 strains 
(Dragic et al . J. Virol. 72:279, 1998; Rabut et al . J . 
Virol. 72:3464, 1998; Farzan et al . J. Virol. 72:1160, 
1998; Dorantz et al . J - Virol. 71:6305, 1997). More 
recently, Farzan et al . demonstrated that tyrosine residues 
in the CCR5 Nt are sulfated (Farzan et al . Cell 96:667, 
1999) 



15 



Inhibition of cellular sulfation pathways, including 
tyrosine sulfation, by sodium chlorate decreased the 
binding of a gpl20/CD4 complex to CCR5 + cells (Farzan et al . 
Cell 96:667, 1999). A number of prior reports had 
implicated a role for- sulfate moieties in HIV-1 entry. 
Several sulfated compounds, such as dextran sulfate, can 
inhibit HIV-1 entry by associating with CD4 or gpl2 0 
(Baeuerle and Huttner J. Cell Biol 105:2655, 1987; Baba et 
al. Proc. Natl. Acad. Sci . USA 85:6132, 1998). Sulfated 
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proteoglycans have been shewn to bind to HIV-1 gpl20 at or 
near its third variable (V3) loop, which also determines 
co-receptor usage (Roderiguez et al . J . Virol. 69:2233, 
1995; Hwang et al . Science 253:71, 1991). It is therefore 
5 conceivable that sulfo- tyrosines in the CCR5 Nt also 
interact with gpl20, increasing its affinity for CCR5 . The 
reduction in gpl20/CD4 binding caused by the pre- treatment 
of target cells with sodium chlorate, however, cannot be 
formally attributed to a reduction in CCR5 tyrosine 
10 sulfation since chlorate can inhibit the sulfation of both 
tyrosines and proteoglycans. 

The region of the CCR5 Nt spanning amino acids 2-18 
'contains residues that are critically important for viral 

15 entry (Dragic et al . J. Virol. 72:279, 1998 ; Rabut et al . 
J. Virol. 72:3464, 1998; Farzan et al . J. Virol. 72:1160, 
1998; Dorantz et al . J. Virol. 71:6305, 1997). We 

previously demonstrated that tyrosines at positions 3, 10 
and 14 were required for optimal co-receptor function, 

20 whereas the TyrlBPhe substitution had little effect on 
entry (Rabut et al . J. Virol. 72:3464, 1998). Taken 
together, these findings suggested that HIV-1 entry- may be 
critically dependent upon sulfation of Tyr-3, -10 and -14, 
but not Tyr-15. We therefore explored the role of sulfo- 

25 tyrosines in positions 3, 10 and 14 by synthesizing 
peptides corresponding to amino acids 2-18 of the CCR5 Nt 
and carrying different tyrosine modifications. We first 
tested the ability of the Nt peptides to inhibit binding of 
gpl20/CD4 complexes and anti-CCR5 MAbs to CCR5 + cells. The 

30 specific association of certain peptides with gp!20/sCD4 
complexes or with anti-CCR5 MAbs was further confirmed by 



surface plasmon resonance (EIAcore) analysis. Inhibition of 
HIV-1 entry by the CCR5 Nt peptides was also tested. Our 
results suggest that amino acids 2-18 of the CCR5 Nt 
compose a gpl2 0 -binding site that determines the 
specificity of the interaction between CCR5 and gp!20s from 
R5 and R5X4 isolates. Post - translational sulfation of the 
tyrosine residues in the CCR5 Nt is required for gp!2 0 
binding and may critically modulate the susceptibility of 
target cells to HIV-1 infection in vivo. 

CCR5 ' s normal physiologic activities involve binding and 
transducing signals mediated by CC-chemokines, including 
RANTES, MlP-la and MIP-1|5, which direct activation and 
trafficking of T cells and other inflammatory cells. As 
such, CCR5 plays an important role in mediating the 
inflammatory reaction of diseases such as rheumatoid 
arthritis and multiple sclerosis. The synovial fluid of 
rheumatoid arthritis patients is highly enriched in CCR5- 
expressing T cells (Qin et al . J Clin Invest 101:746, 
1998) , and CCR5 is the predominant CC chemokine receptor- 
expressed on T cells in the rheumatoid synovium (Gomez- 
Reino et al . Arthritis Rheum 42:989, 1999). Similarly, 
infiltration by CCR5-expressing cells is characteristic of 
plague lesions in patients with multiple schlerosis 
(Balashov et al . Proc Natl Acad Sci USA 96:6873, 1999). 
Such observations provide a rationale for the use of agents 
that block CCR5 for therapy of inflammatory /autoimmune 
diseases, including but not limited to arthritis, multiple 
sclerosis, asthma, psoriasis, autoimmune diabetes, 
transplant rejection, and atherosclerosis- 
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Summary of the Invention 

This invention provides a compound comprising the 
structure : 

0aYDINYYTSE3A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoieucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein 3 
represents from 0 to 13 amino acids, with the proviso that 
if there are more than 2 amino acids, they are joined by 
peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO : 1 
beginning with the P at position 19 and extending therefrom 
in the carboxy terminal direction; wherein 6 represents an 
amino group or an acetylated amino group; wherein X 
represents a carboxyl group or an ami dated carboxyl group; 
wherein all cf a, Y , D, I , N, Y , Y , T, S , E and (3 are joined 
together by peptide bonds; further provided that at least 
two tyrosines in the compound are sulfated. 



This invention also provides a compound comprising the 
structure : 

6aYDINYYTSE(3A 

wherein each T represents a threonine, each S represents a 
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serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
5 with the proviso that if there are more than 2 amino acids, 
they are joined by peptide hcnas in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein 3 
10 represents from 0 to 333 amino acids, with the proviso that 
if there are more than 2 amino acids, they are joined by 
peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO: 1 
beginning with the P at position 19 and extending therefrom 
15 in the carboxy terminal direction; 

wherein 0 represents an amino group or an acetylated amino 
group; wherein A represents a carboxyl group or an ami dated 
carbcxyl group; wherein all of a , Y, D , I , N, Y , Y, T , S , E and 3 
are joined together by peptide bonds; further provided that 
20 at least two tyrosines in the compound are sulfated. 

This invention provides a composition which comprises a 
carrier and an amount of one of the compounds described 
herein effective to inhibit binding of HIV-1 to a CCR5 
25 receptor on the surface of a CD4+ cell. 

This invention provides a method of inhibiting human 
immunodeficiency virus infection of a CD4+ cell which also 
carries a CCR5 receptor on its surface which comprises 
30 contacting the CD4+ cell with an amount of one of the 
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compounds described herein effective to inhibit binding of 
human immunodeficiency virus to the CCR5 receptor so as to 
thereby inhibit human immunodeficiency virus infection of 
the CD4 + cell. 

This invention provides a method of preventing CD4 + cells 
of a subject from becoming infected with human 
immunodeficiency virus which comprises administering to the 
subject an amount of one of the compounds described herein 
effective to inhibit binding of human immunodeficiency 
virus to CCR5 receptors on the surface of the CD4+ cells so 
as to thereby prevent the subject's CD4+ cells from 
becoming infected with human immunodeficiency virus. 

This invention provides a method of treating a subject 
whose CD4+ cells are infected with human immunodeficiency 
virus which comprises administering to the subject an 
amount of one of the compounds described herein effective 
to inhibit binding of human immunodeficiency virus to CCR5 
receptors on the surface of the subject's CD4+ cells so as 
to thereby treat the subject. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 .receptor 
5 which comprises : 

(a) immobilizing one of the compounds described herein on 
a solid support; 

(b) contacting the immobilized compound from step (a) with 
sufficient detectable CCR5 ligand to saturate all 

,0 binding sites fox' the CCR5 ligand on the immobilized 



compound under conditions permitting binding of the 
CCR5 ligand to the immobilized compound so as to form 
a complex; 

(c) removing any unbound CCR5 ligand; 

(d) contacting the complex from step (b) with the agent ; 
and 

(e) detecting whether any CCR5 ligand is displaced from 
the complex, wherein displacement of detectable CCR5 
ligand from the complex indicates that the agent binds 
to the compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

(a) contacting one of the compounds described herein with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the compound 
under conditions permitting binding of the CCR5 ligand 
to the compound so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of CCR5 ligand which is bound to 
the compound in the complex; 

(d) contacting the complex from step (a) with the agent so 
as to displace CCR5 ligand from the complex; 

(e) measuring the amount of CCR5 ligand which is bound to 
the compound in the presence of the agent; and 

(f) comparing the amount of CCR5 ligand bound to the 
compound in step (e) with the amount measured in step 



(c) , wherein a reduced amount measured in step (e) 
indicates that the agent binds 'to the compound so as 
to thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

^. 

This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
receptor which comprises: 

(a) immobilizing one of the compounds described herein on 
10 * a solid support; 

(b) contacting the immobilised compound from step (a) with 
the agent and sufficient detectable CCR5 ligand to 
saturate all binding sites for the CCR5 ligand on the 
compound under conditions permitting binding of the 

15 CCR5 ligand to the immobilized compound so as to form 

a complex; 

(c) removing any unbound CCR5 ligand; 

(d) measuring the amount of detectable CCR5 ligand which 
is bound to the immobilized compound in the complex; 

20 (e) measuring the amount of detectable CCR5 ligand which 
binds to the immobilized compound in the absence of 
the agent; 

(f) comparing the amount of CCR5 ligand which is bound to 
the immobilized compound in step (e) with the amount 
25 measured in step (d) , wherein a reduced amount 

measured in step (d) indicates that the agent binds tc 
the compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 

30 



This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
i-eceptor which comprises: 

(a) contacting one of the compounds described herein with 
the agent and sufficient detectable CCR5 ligand to 
saturate all binding sites for the CCR5 ligand on the 
compound under conditions permitting binding of the 
CCR5 ligand to the compound so as to form a complex ; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of detectable CCR5 ligand which 
is bound to the compound in the complex; 

(d) measuring the amount of detectable CCR5 ligand which 
binds to the compound in the absence of the agent; 

(e) comparing the amount of CCR5 ligand which is bound to 
the compound in step (c) with the amount measured in 
step (d) , wherein a reduced amount measured in step 
(c) indicates that the agent binds to the compound so 
as to thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

a) immobilizing one of the compounds described 



herein on a solid support; 



b) 



contacting the immobilized compound from step a) 
with the agent dissolved or suspended in a known 



vehicle 



and 



measurxng 



the 



binding 



signal 



generated by such contact; 



C ; 



contacting the immobilized compound from step 
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with the known vehicle in the absence of the 
compound and measuring the binding signal 
generated by such contact; 
d) comparing the binding signal measured in step b) 
5 with the binding signal measured in step c) , 

wherein an increased amount measured in step b) 
indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 

10 

This invention provides a method of obtaining a composition 
which comprises: 

(a) identifying a compound which inhibits binding of a 
CCR5 ligand to a CCR5 receptor according to one of the 

15 above methods; and 

(b) admixing the compound so identified or a homolog or 
derivative thereof with a carrier. 

This invention provides a compound having the structure: 
20 A - (aYDINYYTSE(3X) n 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each 1 represents an isoieucine; and each N represents an 

25 asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position S 

30 and extending therefrom in the amino terminal direction; 



wherein (3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with tne P at position 19 and extending 
therefrom in the carboxy terminal direction; whei^ein A 
represents a carboxyi group or an amidated carboxyl group; 
wherein all of a , Y , D , I , N , Y , Y , T , S , E and (B are joined 
together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

This invention also provides a compound having the 
structure : 

(6aYDINYYTSE|3) n - A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoieucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: I beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
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have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein 6 
represents an amino group or an acetylated amino group ; 
5 wherein all of a , Y , D , I,N,Y,Y,T,S,E and 3 are joined 
together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8 , A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
10 parentheses to A. 

This invention provides a compound having the structure: 
A- (aYDINYYTSEpA) n 

wherein each T represents a threonine, each S represents a 

15 serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoieucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 

20 they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein 3 represents from 0 to 333 amino acids, with the 

25 proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carbcxy terminal direction; wherein X 

30 represents a carboxyl group or an amidated carboxyl group; 



wherein all of a, Y, D, I,N / Y,Y,T, S,E and 3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, wherein n is an integer from 1 to 8, 
A is a polymer, and the solid line represents up to 8 
linkers which attach the structure in parentheses to A . . 

This invention also provides a compound having the 
structure : 

( 9aYDINYYTSEp ) n — A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a . glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine,- and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position S 
and extending therefrom in the amino terminal direction ,- 
wherein £ represents from 0 to 333 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carbcxy terminal direction; wherein 6 
represents an amino group or an acetylated amino group ; 
wherein all of a, Y, D, I , N, Y, Y, T, S , E and (3 are joined 
together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 



integer from 1 to 8 , A is 
represents up to 8 linkers 
parentheses to A. 
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a polymer, and the solid line 
which attach the structure in 
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Erief Description of the Figures 

Fig. 1 Effect of peptides on gpl20 JR _ FL binding to CCR5 . 

L1.2-CCR5 4 cells were incubated with the 
5 biotinylated gpl2 G JR _ FL /CD4 - IaG2 complex in the 

presence of different concentration of peptides 
(a) S-3/10/14, £-10/14, S-10, S-14 or (b) P- 
3/10/14, P-10/14, SR-2/12, , SR-10/14, TS-10/14. 
The extent of "complex binding in the absence of 
10 peptide was defined as 100% (rn.f.i. ~40±5). 

Binding in the presence of peptide is expressed 
as a percentage of control. When CCR5-negative 
cells were used, binding of the gpl2 0jp.pl/CD4 - lgG2 
complex was negligible (-10%, rn.f.i. ~2±1). The 
15 values shown are from a representative 

experiment . 



Fig. 2 Einding of the gpl20/sCD4 complex to sulfated and 

phosphorylated peptides . 

20 Biotinylated peptides were immobilized on a 

sensor chip and their ability to associate with 
gpl20/sCD4 was analyzed by BIAcore . RU values as 
a function of time were measured in the absence 
of peptide (gray dotted lines) , in the presence 

25 of phosphorylated peptide (black dotted lines) or 

in the presence of sulfated peptide (solid black 
lines) . We performed binding analyses with the 
following proteins: (a) gpl2 0 JR . FL /sCD4 , (b) gpl20 JK . 
fl ' (c) sCD4 , (d) DV3gpl2 0 JR _ FL /sCD4 , (e) 

30 gpl20 DHi23 /sCD4 , (f) gpl20 

DK123 / (9) 9Pl2 0 laAI / sCD4 anc 
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(h) gpl20 1A1 . 

Fig. 3 Effect of peptides on MAb binding to CCR5 . 

L1.2-CCR5 4 cells were incubated with the anti-CCR5 
5 MAbs in the presence of peptides. The extent of 

MAb binding in the absence of peptide was defined 
as 100% (m.f.i. -50-400, depending on the MAb). 
Binding in the presence of peptide is expressed 
as a percentage of control . When CCR5 -negative 
10 cells were used, binding of MAbs was negligible 

(m.f.i. -2+1). Each data point represents the 
mean ± s.d. of three replicates. 

Fig. 4 Binding of MAbs to sulfated and phosphorylated 

15 peptides . 

Biotinylated peptides were immobilized on a 
sensor chip and their ability to associate with 
anti-CCR5 MAbs was analyzed by BIAcore . RU values 
as a function of time were measured in the 
20 absence of peptide (gray dotted lines) , in the 

presence of phosphorylated peptide (black dotted 
lines) or in the presence of sulfated peptide 
(solid black lines) . We performed binding 
analyses with (a) PAS, (b) PA10 and (c) 2D7 . 

25 

Fig. 5 Effect of peptides on viral entry. 

KeLa-CD4 + CCR5" f cells were infected with Nlluc 4 env 
pseudotyped with different viral envelopes in the 
presence of peptides. Luciferase activity 
30 (r.l.u.) was mesured 48 h post - infect ion . The 
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extent of entry in the absence of peptide was 
defined as 100% (r.l.u.. ~25,000± 9,000). 
Background r.l.u. values were -7+2. Each data 
point represents the mean ±. s.d. of three 
5 replicates. 

Fig- 6 CCR5 Nt peptide sequences and labels 

The primary sequence of each peptide is indicated 
in the left column and the corresponding label is 
10 indicated in the right column. Sulfated tyrosine 

residues are designated by black boxes and white 
boxes designate phosphorylated tyrosine residues . 

Fig. 7 Gpl2 0/CD4 complex binding to CCR5 Nt 

15 sulf opeptides 

Peptide 2-18 was bound to streptavidin- coated 
biosensor chips and gpl2 0jr.pl/sCD4 (dotted line) 
or gpl2 0 JR _ FL /CD4-IgG 2 (solid line) were flowed over 
the sensor chip surface. Resonance units (RU) 

20 were measured as a function of time using a ■ 

Biacore X and reflect complex-peptide binding 
(a). Sulfopeptide 2-18 (solid symbols) or 
phosphopeptide 2-18 (P) (clear symbols) were 
immobilized on streptavidin-coated ELISA plates 

25 and incubated with gpl2 0/CD4 - IgG 2 complexes. Gpl20 

proteins were derived from the R5 isolate JR-FL 
(squares) , the R5X4 isolate DH123 (circles) and 
the X4 isolate LAI (diamonds) . Complexes-peptide 
binding was detected by an HRP-conjugated goat 

30 ant i- human IgG antibody. O.D. at 45 0 nm was 
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measured after addition of HRP substrate and is 
expressed as a function of CD4-IgG 2 concentration 
(b) . Biotinylated sulfopeptide 2-18 was 
immobilized on streptavidin-coated plates and 
5 incubated with gpl2 0/CD4 - IgG : , complex in the 

presence of increasing concentrations of : PA 8 
(solid squares) , TAK-779 (triangles) , Rantes 
(inverted triangles) , MIP-1 (diamonds) , MIP-1 
(circles) or SDF-1 (clear squares) . Binding of 
10 the complexes to the peptide was detected by 

incubation with HRP-conjugated ; goat anti-human 
IgG antibody. O.D. at 450 nm was measured after 
addition of HRP substrate and percentage of 
binding was expressed as a function of inhibitor 
15 concentration. 

Fig. 8: Binding of anti-CCR5 MAhs to CCR5 Nt peptides. 

Sulf opeptides (a) or phosphopept ides (b) were 
immobilized on streptavidin-coated ELISA plates 

20 and incubated with anti-CCR5 MAbs PA 8 (solid 

squares) , PA10 (clear circles) , PA11 (solid 
circles) , PA12 (solid diamonds) or PA14 (solid 
triangles) . Binding of the MAbs to the peptides 
was detected by an HRP-conjugated goat anti-mouse 

25 IgG antibody. O.D. at 450 nm was measured after 

addition of HRP substrate and expressed as a 
function of MAb concentration. 



Fig. 9: 

30 



Binding of gpl2 0 JR . FI /CD4 - IgG 2 to different CCR5 Nt- 
based peptides.. 
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Streptavidin plates were coated with 2-18 (black 
squares) , 10-18 (black circles) , 8-15 (black 
diamonds) , 6-16 (black stars) / 10-15 (white 
square), 10-18 ( 11A/18A) (black triangles). Plates 
5 were then incubated with gpl2 0 JR _ FL /CD4 - IgG 2 

complex. Binding of the complex to the peptide 
was detected by an HRP-conj ugated goat ant i -human 
IgG antibody. O.D. at 450 nm was measured after 
addition of HRP substrate and expressed as a 
10 function of CD4-lgG 2 concentration (nM) . 

Fig. 10: Inhibition of gpl2 0/CD4 - IgG 2 complex binding to 
sulf o-peptides by anti-gpl20 MAbs 

Biotinylated sulfopeptide 2-18 was bound to 
streptavidin-coated biosensor chips and solutions 
of either gpl20 JR _ FL /CD4-IgG 2 complex (black bars) 
or gpl2 0 DH123 /CD4-IgG 2 complex (white bars) were 
flowed over the surface of the chip in the 
presence of different anti-gpl20 MAbs. The names 
of the MAbs and the location of their epitopes 
are indicated along the x-axis. Resonance units 
(RU) were measured as a function of time using a 
Biacore X and reflect complex-peptide binding in 
the presence of the MAbs. Gpl2 0/CD4-IgG 2 binding 
was calculated using the formula: (RU in the 
presence of MAbs) / (RU in the absence of MAbs) 
xl00%. The values shown are from a sample 
experiment . 

30 Fig. 11: Binding of gpl20 mutants to sulf o-peptide and 
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wild type CCR5 . 

Sulf o-peptide 2-18 was immobilized on 

streptavidin-coated plates and incubated with a 
mixture of gpl2 0 -containing supernatant s and CD4- 
5 IgG 2 . Pept ide-complex binding was detected by an 

HRP-conjugated goat anti-human IgG antibody. O.D. 
at 450 nm was measured after addition of HRP 
substrate and normalized for binding of the gpl2 0 
mutants to CD4-IgG 2 . The doted line represents the 

10 normalized value for the binding of the wild type 

gpl2 0 to the peptide. The mutated amino acids and 
their locations in gpl20 are indicated along the 
x-axis (a) . L12-CCR5 + cells were incubated with a 
mixture of gpl20-containing supernatants and CD4 - 

15 IgG 2 . Binding of the complex was detected by FACS 

analysis after addition of streptavidin- PE . 
Percentage of gpl2 0/CD4 - lgG 2 binding to CCR5 was 
normalized for gpl2 0 binding to CD4-IgG 2 . The 
doted line represents the normalized value for 

20 the binding of wild-type gpl20 to the L12-CCR5+ 

cells. The mutated amine acids and their 
locations in gpl20 are indicated along the x-axis 
(b) . 

25 

Fig. 12: Amino acid sequences of CCR5 Nt-based peptides. 

The peptides are named according to the positions 
of their first and last residues in the full- 
length sequence of CCR5 . They contain either 
30 sulf otyrosines (black boxes) or phosphotyrosines 
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(white boxes) n positions 10 and 14. Residues 
Asp-11 and Glu-18 in peptide 10-18 (11A/18A) are 
substituted for alanines. All peptides carry a 
carboxy terminal GAG spacer followed by a 
5 biotinylated lysine. 

Fig. 13: Amino acid conservation among R5 isolates. 

Envelope sequences from 25 R5 strains described 
in the HIV Database and retrieved from the 

10 National Center for Biotechnology Information 

GenBank were aligned and percentage of 
conservation for the indicated residues was 
calculated and combined with results from Hung et 
al . , 19 99 (REF) . Alanine mutants showing more 

15 than 50 % decrease in sulfopeptide 2-18 binding 

compared to the wild type are highlighted in 
gray . 
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Detailed Description of the Invention 

The plasmids CD4 - IgG2-HC-pRcCMV and CD4 -kLC-pRcCMV were 
deposited pursuant to, and in satisfaction of, the 
requirements of the Budapest Treaty on the International 
5 Recognition of the Deposit of Microorganisms (the "Budapest 
Treaty") for the Purposes of Patent Procedure with the 
American Type Culture Collection (ATCC) , 10801 University 
Boulevard, Manassas, Virginia 20110-2209 under ATCC 
Accession Nos . 75193 and 75194, respectively. 

10 

The plasmids designated PPI4-tPA-gpl20 JR | FL and PPI4-tPA- 
gpl20 LAI were deposited pursuant to, and in satisfaction of, 
the requirements of the Budapest Treaty on the 
International Recognition of the Deposit : of Microorganisms 

15 for the Purposes of Patent Procedure with the American Type 
Culture Collection (ATCC) , 10801 University Boulevard, 
Manassas, Virginia 20110-2209 under ATCC Accession Nos. 
75431 and 75432, respectively. These plasmids were 

deposited with ATCC on March 12, 1993. These eukaryotic 

20 shuttle vectors contain the cytomegalovirus major 
immediate-early (CMV MIE) promoter/enhancer linked to the 
full-length HIV-1 envelope gene whose signal sequence was 
replaced with that derived from tissue plasminogen 
activator. In the vector, a stop codon has been placed at 

25 the gpl20 C-terminus to prevent translation of gp41 
sequences, which are present in the vector. The vector 
also contains an ampicillin resistance gene, an SV40 origin 
of replication and a DHFR gene whose transcription is 
driven by the p-globin promoter. 



30 
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The monoclonal antibodies PA8 , PA10, PA11, PA12, and PA14 
were deposited pursuant to and in satisfaction of, the 
requirements of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the 
5 Purposes of Patent Procedure with the American Type Culture 
Collection (ATCC) , 10801 University Boulevard, Manassas, 
Virginia 20110-2209 on December 2, 1998 under the following 
Accession Nos. : ATCC Accession No. KB-12605 (PA8), ATCC 
Accession No.HB-12607 (PA10) , ATCC Accession No. HB-12608 
10 (PAID, ATCC Accession No. HB-12609 (PA12), and ATCC 
Accession No. HB-12610 (PA14) . 

As used herein, the following standard abbreviations are 
, used throughout the specification to indicate specific 
15 amino acids: 



A=ala=alanine R=arg=arginine 
N=asn=asparagine D=asp=aspartic acid 

C=cys= cysteine Q=gln=glut amine 

20 E=glu=glutamic acid G=gly-glycine 

H=his=histidine I=ile=isoleucine 
L=leu= leucine K=lys= lysine 

M=met=methionine F=phe=phenyl alanine 

P=pro=proline S=ser= serine 

25 T=thr= threonine W=trp=tryptophan 

Y=tyr= tyrosine V=val=valine 
B=asx=asparagine or aspartic acid 
Z=glx=glut amine or glutamic acid 



30 As used herein, the following standard abbreviations are 
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used throughout the specification to indicate specific 
nucleotides: C=cytosine; A=adenosine; T=thymidine; 
G= guanosine; and U=uracil . 

5 This invention provides a compound comprising the 
structure : 

GaYDINYYTSEPA 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 

10 represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 ,to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order and 

15 have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein (3 
represents from 0 to 13 amino acids, with the proviso that 
if there are more than 2 amino acids, they are joined by 

20 peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO : 1 
beginning with the P at position 19 and extending therefrom 
in the carboxy terminal direction; 

wherein 6 represents an amino group or an acetylated amino 
25 group; wherein X represents a carboxyl group or an amidated 
carboxyl group; wherein all of a , Y , D, I , N, Y , Y , T, S , E and (5 
are joined together by peptide bonds; further provided that 
at least two tyrosines in the compound are sulfated. 

30 In one embodiment of the above compound, the compound is 
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peptide which comprises consecutive amino acids having the 
sequence YDINYYTSE . 

In one embodiment of the above compound, the tyrosines at 
5 positions 1 and 5 of the sequence YDINYYTSE are sulfated. 



In one embodiment of the above compound, Of represents less 
than 9 amino acids. In another embodiment of the above 
compound, a represents less than 8 amino acids. In another 

10 embodiment of the above compound, a represents less than 7 
amino acids. In another embodiment of the above compound, a 
represents less than 6 amino acids. In another embodiment 
of the above compound, a represents less than 5 amino 
acids . In another embodiment of the above compound, a 

15 represents less than 4 amino acids. In another embodiment 
of the above compound, a represents less than 3 amino 
acids. In another embodiment of the above compound, a 
represents less than 2 amino acids. In another embodiment 
of the above compound, a represents less than 1 amino acid. 

20 

In one embodiment of the above compound, (3 represents less 
than 17 amino acids. In one embodiment of the above 
compound, (3 represents less than 16 amino acids.. In one 
embodiment of the above compound, (3 represents less than 15 

25 amino acids. In one embodiment of the above compound, (3 
represents less than 14 amino acids. In one embodiment of 
the above compound, [3 represents less than 13 amino acids. 
In one embodiment of the above compound, (3 represents less 
than 12 amino acids. In one embodiment of the above 

30 compound, (3 represents less than 11 amino acids. In one 
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embodiment of the above compound, 3 represents less than 10 
amino acids. In one embodiment of the above compound, 3 
represents less than 9 amino acids. In one embodiment of 
the above compound, (3 represents less than 8 amino acids. 
5 ■ In one embodiment of the above compound, 3 represents less 
than 7 amino acids . In one embodiment of the above 
compound, 3 represents less than. 6 amino acids. In one 
embodiment of the above compound, (3 represents less than 5 
amino acids. In one embodiment of the above compound, 3 
10 represents less than 4 amino acids. In one embodiment of 
the above compound, 3 represents less than 3 amino acids. 
In one embodiment of the above compound, 3 represents less 
than 2 amino acids . In one embodiment of the above 
compound, (3 represents less than 1 amino acid. 

15 

This invention also provides a compound comprising the 
structure : 

6 a YD I N Y YT S E 3 A 

wherein each T represents a threonine, each S represents a 
20 serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
25 they are joined by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein 3 
represents from 0 to 333 amino acids, with the proviso that 
30 if there are more than 2 amino acids, they are joined by 
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peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO: 1 
beginning with the P at position 19 and extending therefrom 
in the carboxy terminal direction; 
5 wherein 9 represents an amino group or an acetylated amino 
group; wherein X represents a carboxyl group or an amidated 
carboxyl group; wherein all of a, Y , D, I , N, Y, Y, T, S , E and 3 
are joined together by peptide bonds; further provided that 
at least two tyrosines in the compound are sulfated. 

10 

In the compounds described herein and as 1 exemplified above, 
the 3 in each compound may alternatively represent from 0 
to 333 amino acids. 

15 In one embodiment of the compounds described herein, 3 
represents less than 300 amino acids. In another embodiment 
of the above compound, 3 represents less than 250 amino 
acids. In another embodiment of the above compound, 3 
represents less than 200 amino acids. In another embodiment 

20 of the above compound, (3 represents less than 150 amino 
acids. In another embodiment of the above compound, 3 
represents less than 100 amino acids. In another embodiment 
of the above compound, 3 represents less than 75 amino 
acids. In another embodiment of the above compound, 3 

25 represents less than 50 amino acids. In another embodiment 
of the above compound, 3 represents less than 4 0 amino 
acids. In another embodiment of the above compound, (3 
represents less than 35 amino acids. In another embodiment 
of the above' compound, 3 represents less than 3'0 amino 

30 acids. In another embodiment of the above compound, 3 



represents less than 25 amino acids. In another embodiment 
of the above compound, 3 represents less than 20 amino 
acids. In another embodiment of the above compound, 3 
represents less than 19 amino acids. In another embodiment 
of the above compound, (5 represents less than 18 amino 
acids. In another embodiment of the above compound, (3 
represents less than 17 amino acids. In another embodiment 
of the above compound, 3 represents less than 16 amino 
acids. In another embodiment of the above compound, 3 
represents less than 15 amino acids. In another embodiment 
of the above compound, 3 represents less than 14 amino 
acids. In another embodiment of the above compound, 3 
represents less than 13 amino acids. In another embodiment 
of the above compound, 3 represents less than 12 amino 
acids. In another embodiment of the above compound, 3 
represents less than 11 amino acids. 

In one embodiment of the above compound, a represents less 
than 9 amino acids. In another embodiment of the above 
compound, a represents less than 8 amino acids. In another 
embodiment of the above compound, a represents less than 7 
amino acids. In another embodiment of the above compound, a 
represents less than 6 amino acids. In another embodiment 
of the above compound, a represents less than 5 amino 
acids. In another embodiment of the above compound, a 
represents less than 4 amino acids. In another embodiment 
of the above compound, a represents less than 3 amino 
acids. In another embodiment of the above compound, a 
represents less than 2 amino acids. In another embodiment 
of the above compound, a represents less than 1 amino acid. 
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The CCR5 amino acid sequence is the following and is set 
forth in SEQ ID NO:l: 

1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLV 
41 FIFGFVGNMLVILILINCKRLKSMTDIYLLNLAISDLFFL 
81 LTVPFWAHYAAAQWDFGNTMCOLLTGLYFIGFFSGIFFII 
121 LLTIDRYLAWHAVFALKARTVTFGWTSVITWWAVFAS 
161 LPGI I FTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKI VI 

2 01 LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIFTI 
241 MIVYFLFWAPYNIVLLLNTFQEFFGLiNNCSSSNRLDQAMQ 

2 81 VTETLGMTHCCINPI IYAFVGEKFRNYLLVFFQKHIAKRF 

3 21 CKCCS I FQQEAPERAS SVYTRSTGEQE I S VGL 3 52 



The CCR5 nucleotide sequence is the following and is set 
forth in SEQ ID NO: 2: 



1 


GAATTCCCCC AACAGAGCCA AGCTCTCCAT CTAGTGGACA GGGAAGCTAG CAGCAAACCT 


61 


TCCCTTCACT 


ACAAAACTTC 


ATTGCTTGGC 


CAAAAAGAGA 


GTTAATTCAA 


TG TAG AC AT C 


121 


TATGTAGGCA 


ATTAAAAACC 


TATTGATGTA 


TAAAACAGTT 


TGCATTCATG 


GAGGGCAACT 


181 


AAATACATTC 


TAGGACTTTA 


TAAAAG AT C A 


CTTTTTATTT 


ATGCACAGGG 


TGGAACAAGA 


241 


TG G ATT AT C A 


AGTGTCAAGT 


CCAATCTATG 


ACATCAATTA 


TTATACATCG 


GAGCCCTGCC 


301 


AAAAAATCAA 


TGTGAAGCAA 


ATCGCAGCCC 


GCCTCCTGCC 


TCCGCTCTAC 


TCACTGGTGT 


361 


TCATCTTTGG TTTTGTGGGC AACATGCTGG TCATCCTCAT CCTGATAAAC TGCAAAAGGC 


421 


TG AAG AG CAT 


G AC TG AC AT C 


TACCTGCTCA 


ACCTGGCCAT 


CTCTGACCTG 


TTTTTCCTTC 


481 


TTACTGTCCC 


CTTCTGGGCT 


CACTATGCTG 


CCGCCCAGTG 


GGACTTTGGA . 


AATACAATGT 


541 


GTCAACTCTT 


GACAGGGCTC 


TATTTTATAG 


GCTTCTTCTC 


TGGAATCTTC 


TTCATCATCC 


601 


TCCTGACAAT 


CGATAGGTAC 


CTGGCTGTCG 


TCCATGCTGT 


GTTTGCTTTA 


AAAGC CAGGA 


661 


CGGTCACCTT 


TGGGGTGGTG 


ACAAGTGTGA 


TCACTTGGGT 


GGTGGCTGTG 


TTTGCGTCTC 


721 


TCC CAGGAAT 


CATCTTTACC 


AGATCTCAAA 


AAGAAGGTCT 


TCATTACACC 


TGCAGCTCTC 


781 


ATTTTCCATA 


CAGTCAGTAT 


CAATTCTGGA 


AGAATTTCCA 


GACATTAAAG 


ATAG T CATC T 


841 


TGGGGCTGGT 


CCTGCCGCTG 


CTTGTCATGG 


TCATCTGCTA 


CTCGGGAATC 


CTAAAAACTC 


901 


TGCTTCGGTG 


TCGAAATGAG 


AAG AAGAGG C 


ACAGGGCTGT 


GAGGCTTATC 


TTCACCATCA 


961 


TGATTGTTTA 


TTTTCTCTTC 


TGGGCTCCCT 


ACAACATTGT 


CCTTCTCCTG 


AACACCTTCC 


1021 


AGGAATTCTT 


TGGCCTGAAT 


AATTGCAGTA 


GCTCTAACAG 


GTTGGACCAA 


GCTATGCAGG 


1081 


TGACAGAGAC 


TCTTGGGATG 


ACGCACTGCT 


GCATCAACCC 


CATC AT CTAT 


GCCTTTGTCG 


1141 


GGGAGAAGTT 


CAGAAACTAC 


CTCTTAGTCT 


TCTTCCAAAA 


GCACATTGCC 


AAACGCTTCT 



12 01 GCAAATGCTG TTCTATTTTC CAGCAAGAGG CTCCCGAGCG AGCAAGCTCA GTTTACACCC 
12 61 GATCCACTGG GGAGCAGGAA ATATCTGTGG GCTTGTGACA CGGACTCAAG TGGGCTGGTG 
1321 ACCCAGTCAG AGTTGTGCAC ATGGCTTAGT TTTCATACAC AGCCTGGGCT GGGGGT 

The YDINYYTSE sequence corresponds to amino acid residues 
10-18 of the CCR5 sequence set forth above. 

As used herein, "CCR5" is a chemokine receptor which binds 
members of the CC group of chemokines and whose amino acid 
sequence comprises that provided in Genbank Accession 
Number 1705896 and related polymorphic variants. The 
nucleotide sequence comprises that provided in Genbank 
Accession Number X91492. In one embodiment, the above 
compound may correspond to the extracellular portion of 
CCR5 . The first 31 amino acids of CCR5 correspond to the 
extracellular portion of CCR5 . Accordingly, the 
extracellular portion extends from the methionine at 
position number 1 to the arginine at position number 31 of 
SEQ ID NO : 1 . In another embodiment, the above compound may 
correspond to the amino- terminal portion of. CCR5 . As used 
herein, "N- terminus" or amino -terminus means the sequence of 
amino acids spanning the initiating methionine and the 
first transmembrane region. 

As used herein, "H 2 N" refers to the N-terminus or amino- 
terminus. As used herein, "COOH" refers to the C-terminus 
or carboxy- terminus . 

Various tyrosines of the compounds described herein may be 
sulfated. These include but are not limited to the 
tyrosines at positions 3, 10 and 14 of amino acid sequence 
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set forth in SEQ ID NO : 1 . Accordingly, in one embodiment, 
the tyrosines at positions 10 and .14 are sulfated. In 
another embodiment, the tyrosines at positions 3 and 14 are 
sulfated. In another embodiment, the tyrosines at positions 
5 3 and 10 are sulfated. In another embodiment, the tyrosines 
at positions 3, 10 and 14 are sulfated. Other tyrosines in 
the sequence set forth in SEQ ID NO : 1 may also be sulfated. 

This invention provides a composition comprising one of the 
10 compounds described, herein and a detectable marker attached 
thereto. In one embodiment of the i composition, the 
detectable marker is biotin. In one embodiment of the 
composition, the detectable marker is attached at the C- 
terminus of the compound. 

15 

The compounds of the subject invention may also be isolated 
or purified. In one embodiment the compound is labeled with 
a detectable marker. As used herein, chemical "labels" 
include radioactive isotopes, fluorescent groups and 
20 affinity moieties such as biotin that facilitate detection 
of the labeled peptide. Other chemical labels are well- 
known to those skilled in the art. Methods for attaching 
chemical labels to peptides are well-known to the skilled 
artisan. 

25 

As used herein, "peptide" and "polypeptide" are used to 
denote two or more amino acids linked by a peptidic bond 
between the a-carboxyl group of one amino acid and the oc- 
amino group of the next amino acid. Peptides may be 
30 produced by solid-phase synthetic methods that are well- 



known to those skilled in the art. In addition to the above 
set of twenty amino acids that are used for protein 
synthesis in vivo, peptides may contain additional amino 
acids, including but not limited to hydroxyproline , 
sarcosine, and ycarboxyglutamate . The peptides may contain 
modifying groups including but not limited to sulfate and 
phosphate moieties. Peptides can be comprised of L- or D- 
amino acids, which are mirror- image forms with differing 
optical properties.- Peptides containing D-amino acids have 
the advantage of being less susceptible to proteolysis in 
vivo . 

Peptides may by synthesized in monomeric linear form, 
cyclized form or as oligomers such as branched multiple, 
antigen peptide (MAP) dendrimers (Tarn et al . Biopolymers 
51:311, 1999). Nonlinear peptides may have increased 
binding affinity by virtue of their restricted 
conformations and/or oligomeric nature. Peptides may also 
be produced using recombinant methods as either isolated 
peptides or as a portion of a larger fusion protein that 
contains additional amino acid sequences. 

Peptides may be chemically conjugated to proteins by a 
variety of . well-known methods. Such peptide-protein 
conjugates can be formulated with a suitable adjuvant and 
administered parenterally for the purposes of generating 
polyclonal and monoclonal antibodies to the peptides of 
interest. Alternatively, unconjugated peptides can be 
formulated with adjuvant and administered to laboratory 
animals for the purposes of generating antibodies. Methods 
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for generating and isolating such antibodies are well-known 
to those skilled in the art. 

This invention provides derivatives of the above compound. 
5 As used herein, a "derivative" peptide is one whose amino 
acid sequence is nonidentical to the reference peptide but 
which possesses functionally similar binding properties. 
Derivative peptides may also contain N-terminal, C-terminal 
and/or internal insertions, deletions, or substitutions of 

10 amino acids, with the proviso that such insertions, 
deletions and substitutions do not abrogate the binding 
properties of the peptide. Derivative ( peptides include 
peptides modified with chemical labels to facilitate 
detection. Derivative peptides include branched and 

15 cyclized peptides. 

As used herein, "sulf opept ides" are peptides that contain 
sulfate moieties attached to one or more amino acids, such 
as tyrosine. In "sulf o-tyrosines" , a sulfate group replaces 
20 the para- hydroxy 1 group located on tyrosine side-chain. 

As used herein,' "phosphopept ides" are peptides that contain 
phosphate moieties attached to one or more amino acids, 
such a tyrosine. In "phospho- tyrosines" , a phosphate group 
25 replaces the para- hydroxy 1 group located on tyrosine side- 
chain. - 

The peptides of the subject invention may be sulfated when 
synthesized or they may be subsequently sulfated. For 
30 example, means of sulfating the peptides include chemical 



sulfation or enzymatic sulfation. One skilled in the art 
would know how to employ these and other techniques to 
sulfate the compound. 

This invention provides a composition which comprises a 
carrier and an amount of one of the compounds ' described 
herein effective to inhibit binding of HIV-1 to a CCR5 
receptor on the surface of a CD4+ cell. 

The carriers include but are not limited to an aerosol, 
intravenous, oral or topical carrier. Accordingly. The 
invention provides the above composition adapted for 
aerosol, intravenous, oral or topical application. 

This invention provides the above compositions and a 
pharmaceutically acceptable carrier. Pharmaceut ically 

acceptable carriers are well known to those skilled in the 
art. Such pharmaceutically acceptable carriers may include 
but are not limited to aqueous or non- aqueous solutions, 
suspensions, and emulsions. Examples of non-aqueous 

solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic 
esters such as ethyl oleate. Aqueous carriers include 
water, alcoholic/aqueous solutions, emulsions or 

suspensions, saline and buffered media. Parenteral 
vehicles include sodium chloride solution, Ringer's 
dextrose, dextrose and sodium chloride, lactated Ringer's 
or fixed oils. Intravenous vehicles include fluid and 
nutrient replenishers , electrolyte replenishers such as 
those based on Ringer's dextrose, and the like. 
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Preservatives and other additives may also be present, such 
as, for example, antimicrobials, antioxidants, chelating 
agents, inert gases and the like. 

5 As used herein, "composition" means a mixture. The 
compositions include but are not limited to those suitable 
for oral, rectal, intravaginal , topical, nasal, opthalmic, 
or parenteral administration to a subject. As used herein, 
"parenteral" includes but is not limited to subcutaneous, 
10 intravenous, intramuscular, or intrasternal injections or 
infusion techniques . j 

l 

As used herein, "administering" may be effected or performed 
using any of the methods known to one: skilled in the art. 

15 The methods may comprise intravenous, intramuscular or 
subcutaneous means. As used herein, "effective dose 1 ' means 
an amount in sufficient quantities to either treat the 
subject or prevent the subject from becoming infected with 
HIV-1. A person of ordinary skill in the art can perform 

20 simple titration experiments to determine what amount is 
required to treat the subject. 

This invention provides a method of inhibiting human 
immunodeficiency virus infection of a CD4+ cell which also 

25 carries a CCR5 receptor on its surface which comprises 
contacting the CD4+ cell with an amount of one of the 
compounds described herein effective to inhibit binding of 
human immunodeficiency virus to the CCR5 receptor so as to 
thereby inhibit human immunodeficiency virus infection of 

30 the CD4+ cell. As used herein, "inhibits" means that the 



amount is reduced. In a preferred embodiment, inhibits 
means that the amount is reduced 10 0%. 

In one embodiment of this method, the CD4+ cell is present 
in a subject and the contacting is effected by- 
administering the compound to the subject. 

This invention provides a method of preventing CD4 + cells 
of a subject from becoming infected with human 
immunodeficiency virus which comprises administering to the 
subject an amount of one of the compounds described herein 
effective to inhibit binding of human immunodeficiency 
virus to CCR5 receptors on the surface of the CD4+ cells so 
as to thereby prevent the subject's CD4 + cells from 
becoming infected with human immunodeficiency virus. 

This , invention provides a method of treating a subject 
whose CD4+ cells are infected with human immunodeficiency 
virus which comprises administering to the subject an 
amount of one of the compounds described herein effective, 
to inhibit binding of human immunodeficiency virus to CCR5 
receptors on the surface of the subject's CD4+ cells so as 
to thereby treat the subject. 

As used herein, human immunodeficiency virus includes but 
is not limited to HIV-l, which is the human 
immunodeficiency virus type-1. HIV-1 includes but is not 
limited to extracellular virus particles and the forms of 
HIV-1 found in HIV-1 infected cells. 
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As used herein, "HIV-1 infection" means the introduction of 
HIV-1 genetic information into a target cell, such as by 
fusion of the target cell membrane with HIV-1 or an HIV-1 
envelope glycoprotein* cell. The target cell may be a bodily 
5 cell of a subject. In the preferred embodiment, the target 
cell is a bodily cell from a human subject. 

As used herein, "inhibiting HIV-1 infection" means the 
reduction of the amount of HIV-1 genetic information 
10 introduced into a target cell population as compared to the 
amount that would be introduced without the composition. 

i 

In the above methods, the compound may be administered by 
various routes including but not limited to aerosol, 

15 intravenous, oral or topical route. The administration may 
comprise intralesional , intraperitoneal, intramuscular or 
intravenous injection; infusion; liposome-mediated 

delivery; topical, intrathecal, gingival pocket, per 
rectum, intrabronchial , nasal, oral, ocular or otic 

20 delivery. In a further embodiment, the administration 
includes intrabronchial administration, anal, intrathecal 
administration or transdermal delivery. In another 
embodiment, the compound is administered hourly, daily, 
weekly, monthly or annually. In another embodiment, the 

25 effective amount of the compound comprises from about 
0.0 00001 mg/kg body weight to about 100 mg/kg body weight. 

The administration may be constant for a certain period of 
time or periodic and at specific intervals. The compound 
30 may be delivered hourly, daily, weekly, monthly, yearly 
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(e.g. in a time release form) or as a one time delivery. 
The delivery may be continuous delivery for a period of 
time, e.g. intravenous delivery. 

5 The carrier may be a diluent, an aerosol,- a topical 
carrier, an aqeuous solution, a nonaqueous solution or a 
solid carrier. 

■The effective amount of the compound may comprise from 

10 about 0.000001 mg/kg body weight to about 100 mg/kg body 
weight. In one embodiment, the effective amount may 
comprise from about 0.001 mg/kg body weight to about >50 
™g/kg body weight. In another embodiment, the effective 
amount may range from about 0.01 mg/kg body weight "to about 

15 10 mg/kg body weight. The actual effective amount will be 
based upon the size of the compound, the biodegradability 
of the compound, the bioactivity of the compound and the 
bioavailability of the compound. If the compound does not 
degrade quickly, is bioavailable and highly active, a 

20 smaller amount will be required to be effective. The 
effective amount will be known to one of skill in the art; 
it will also be dependent upon the form of the' compound, 
the size of the compound and the bioactivity of the 
compound. One of skill in the art could routinely perform 

25 empirical activity tests for a compound to determine the 
bioactivity in bioassays and thus determine the effective 
amount . 

The compound of the present invention may be delivered 
30 locally via a capsule which allows sustained release of the 
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agent or the peptide over a period of time. Controlled or 
sustained release compositions include formulation in 
lipophilic depots (e.g., fatty acids, waxes, oils). Also 
comprehended by the invention are particulate compositions 
5 coated with polymers (e.g., poloxamers or poloxamines) and 
the agent coupled to antibodies directed against tissue- 
specific receptors, ligands or antigens or coupled to 
ligands of tissue- specif ic receptors. Other embodiments of 
the compositions of the invention incorporate particulate 
10 forms protective coatings, protease inhibitors or 
permeation enhancers for various routes of administration, 
including parenteral, pulmonary, nasal and oral. 

In one embodiment of the above methods, the subject is 
15 infected with HIV-1 prior to administering the compound to 
the subject. In one embodiment of the above methods, the 
subject is not infected with HIV-1 prior to administering 
the compound to the subject. In one embodiment of the above 
methods, the subject is not infected with, but has been 
20 exposed to, human immunodeficiency virus. 

In one embodiment of the above methods, the effective 
amount of the compound comprises from about 1.0 ng/kg to 
about 100 mg/kg body weight of the subject. In another 

25 embodiment of the above methods, the effective amount of 
the compound comprises from about 100 ng/kg to about 5 0 
™Ef/kg body weight of the subject. In another embodiment of 
the above methods, the effective amount of the compound 
comprises from about 1 /ig/kg to about 10 mg/kg body weight 

30 of the subject. In another embodiment of the above methods, 
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the effective amount of the compound comprises from about 
100 M9/ k 9 to about 1 mg/kg body weight of the subject. 

The dose of the composition of the invention will vary 
5 depending on the subject and upon the particular route of 
administration used. Dosages can range from 0.1 to 100,000 
/zg/kg. Based upon the composition, the dose can be 
delivered continuously, such as by continuous pump, or at 
periodic intervals. For example, on one or more separate 
10 occasions. Desired time intervals of multiple doses of a 
particular composition can be determined without undue 
experimentation by one skilled in the art. 

As used herein, "effective dose" means an amount in 
sufficient quantities to either treat the subject or 
prevent the subject from becoming infected with HIV-1. A 
person of ordinary skill in the art can perform simple 
titration experiments to determine what amount is required 
to treat the subject. 

In one embodiment of the above method, the subject is a 
human being. As used herein, "subject" means any animal or 
artificially modified animal capable of becoming HIV- 
infected. Artificially modified animals include, but are 
not limited to, SCID mice with human immune systems. The 
subjects include but are not limited to mice, rats, dogs, 
guinea pigs, ferrets, rabbits, and primates. In the 
preferred embodiment, the subject is a human being. 

30 This invention provides a vaccine which comprises the 



15 



20 



25 
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compound described herein. Vaccines comprising the 
sulf ©peptides and a suitable adjuvant could be administered 
to a subject for the purposes of generating antibodies or 
other immune responses that are* of therapeutic or 
5 prophylactic value. For example, the vaccines could be 
administered for the purpose of generating in the subject 
antibodies that bind CCR5 and inhibit its ability to 
mediate HIV entry and infection, thereby protecting the 
subject from HIV infection or disease progression. The 
10 vaccines may also comprise a suitable adjuvant. The vaccine 
may also comprises a suitable carrier. j 

The subject invention has various applications which 
includes HIV treatment such as treating a subject who has 

15 become afflicted with HIV. As used herein, "afflicted with 
HIV-l" means that the subject has at least one cell which 
has been infected by HIV-l. As used herein, "treating" 
means either slowing, stopping or reversing the progression 
of an HIV-l disorder. In the preferred embodiment, 

20 "treating" means reversing the progression to the point of 
eliminating the disorder. As used herein, "treating" also 
means the reduction of the number of viral infections, 
reduction of the number of infectious viral particles, 
reduction of the number of virally infected cells, or the 

25 amelioration of symptoms associated with HIV-l. Another 
application of the subject invention is to prevent a 
subject from contracting HIV. As used herein, "contracting 
HIV-l" means becoming infected with HIV-l, whose genetic 
information replicates in and/or incorporates into the host 

30 cells. Another application of the subject invention is to 
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treat a subject who has become infected with HIV-1. As used 
herein, "HIV-l infection" means the introduction of HIV-1 
genetic information into a target cell, such as by fusion 
of the target cell membrane with HIV-1 or an HIV-1 envelope 
5 glycoprotein- cell. The target cell may be a bodily cell of 
a subject. In the preferred embodiment, the target cell is 
a bodily cell from a human subject. Another application of 
the subject invention is to inhibit HIV-1 infection. As 
used herein, "inhibiting HIV-1 infection" means reducing 
10 the amount of HIV-1 genetic information introduced into a 
target cell population as compared to the amount that would 
be introduced without said composition. 

This invention provides a method of identifying an agent 
15 which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

(a) immobilizing one of the compounds described herein on 
a solid support; 

(b) contacting the immobilized compound from step (a) with 
20 sufficient detectable CCR5 ligand to saturate all 

binding sites for the CCR5 ligand on the immobilized 
compound under conditions permitting binding of the 
CCR5 ligand to the immobilized compound so as to form 
a complex; 

25 (c) removing any unbound CCR5 ligand; 

(d) contacting the complex from step (b) with the agent; 
and 

( e ) detecting whether any CCR5 ligand is displaced from 
the complex, wherein displacement of detectable CCR5 

30 ligand from the complex indicates that the agent binds 
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to the compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

(a) contacting one of the compounds described herein with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the compound 
under conditions permitting binding of the CCR5 ligand 
to the compound so as to form a complex; - 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of CCR5 ligand: which is bound to 
the compound in the complex; 

(d) contacting the complex, from step (a) with the agent so 
as to displace CCR5 ligand from the complex; 

(e) measuring . the amount of CCR5 ligand which is bound to 
the compound in the presence of the agent; and 

(f) comparing the amount of CCR5 ligand bound to the 
compound in step (e) with the amount measured in step 
(c) , wherein a reduced amount measured in step (e) 
indicates that the agent binds to the compound so as 
to thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
receptor which comprises: 

(a) immobilizing one of the compounds described herein on 
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a solid support ; 

contacting the immobilized compound from step (a) with 
the agent and detectable CCR5 ligand -under conditions 
permitting binding of the CCR5 .• ligand to the 
immobilized compound so as to form a complex; 
removing any unbound CCR5 ligand; 

measuring the amount of detectable CCR5 ligand which 
is bound to the immobilized compound in the complex ; 
measuring the amount of detectable CCR5 ligand which 
binds to the immobilized compound in the absence of 
the agent ; 

comparing the amount of CCR5 ligand which is bound to 
the immobilized compound in step (e) with the amount 
measured in step (d) , wherein a reduced amount 
measure d in step (d) indicates that the agent binds to 
the compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 

20 In one embodiment of the above method, the amount of the 
detectable CCR5 ligand in step (a) and step (e) is 
sufficient to saturate all binding sites for the CCR5 
. ligand on the compound. 

25 This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
receptor which comprises: 

(a) contacting one of the compounds described herein with 
the agent and detectable CCR5 ligand under conditions 
30 permitting binding of the CCR5 ligand to the compound 



10 



(b) 



(c) 
(d) 

(e) 



(f ) 



15 



so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of detectable CCR5 ligand which 
is bound to the compound in the complex; 

(d) measuring the amount of detectable CCR5 ligand which 
binds to the compound in the absence of the agent; 

(e) comparing the amount of CCR5 . ligand which is bound to 
the compound in step (c) with the amount measured in 
step <d) , wherein a reduced amount measured in step 
(c) indicates that the agent binds to the compound so 

as to thereby identify the agent as one which inhibits 

i 

binding of the CCR5 ligand to the CCR5 receptor. 

i 

In one embodiment of the above method, the amount of the 
detectable CCR5 .ligand in step (a) and step (d) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 

In one embodiment of the above method the solid support is 
a microtiter plate well. In another embodiment, the solid 
support is a bead. In a further embodiment, the solid 
support is a surface plasmon resonance sensor chip. The 
surface plasmon resonance sensor chip can have pre- 
immobilized streptavidin . In ' one embodiment, the surface 
plasmon resonance sensor chip is a BIAcore™ chip. 

In one embodiment of the above methods, the detectable CCR5 
ligand is labeled with a detectable marker. In another 
embodiment of the above methods, the CCR5 ligand is 
detected by contacting it with another compound which is 
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both capable of detecting the CCR5 ligand and is 
detectable. The detectable markers include those described 
above . 

5 This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises : 

a) immobilizing one of the compounds described 
herein on a solid support; 
1Q b) contacting the immobilized compound from step a) 

with the agent dissolved or suspended in a known 
vehicle and measuring the . binding signal 
generated by such contact; 

c) contacting the immobilized compound from step a) 
15 with the known vehicle in the absence of the 

compound and measuring the binding signal 
generated by such contact; 

d) comparing the binding signal measured in step b) 
with the binding signal measured in step c) , 

20 wherein an increased amount measured in step b) 

indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 

25 In one embodiment of the above method, the solid support is 
a surface plasmon resonance sensor chip. In another 
embodiment of the above method, the binding signal is 
measured by surface plasmon resonance. 



30 This invention provides a method of obtaining a composition 
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which comprises: 

(a) identifying a compound which inhibits binding of a 
CCR5 ligand to a CCR5 receptor according to one of the 
above methods ; and 
5 (b) admixing the compound so identified or a homolog or 
derivative thereof with a carrier. 



The invention provides agents identified in the screen. 
Such agents may have utility in treating HIV-1 infection or 
10 other CCR5 -mediated diseases, which include rheumatoid 
arthritis, asthma, multiple sclerosis, psoriasis, 
atherosclerosis and other inflammatory diseases. 

In one embodiment of the above methods, the CCR5 ligand is 
15 a complex comprising an HIV-1 envelope glycoprotein and a 
CD4 -based protein. The HIV-1 envelope glycoproteins include 
but are not limited to gpl2 0, gpl40 or gpl60. The CD4 -based 
proteins include but are not limited to soluble CD4 or CD4- 
IgG2. 

20 

As used herein, "CD 4 n means the mature, native, membrane- 
bound CD4 protein comprising a cytoplasmic domain, a 
hydrophobic transmembrane domain, and an extracellular 
domain that binds to the HIV-1 gpl20 envelope glycoprotein. 

25 As used herein, "HIV-1 envelope glycoprotein" means the HIV- 
1 encoded protein which comprises the gpl2 0 surface 
protein, the gp41 transmembrane protein and oligomers and 
precursors thereof. As used herein, "CD4-based protein" 
means any protein comprising at least one sequence of amino 

30 acid residues corresponding to that portion of CD4 which is 
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required for CD4 to form a complex with the HIV-1 gpl2 0 
envelope glycoprotein. As used herein, "CD4-IgG2" means a 
heterotetrameric CD4 -human IgG2 fusion protein encoded by 
the expression vectors deposited under ATCC Accession 
5 Numbers 75193 and 75194. 

In one embodiment of the above methods, the CCR5 ligand is 
a chemokine . The chemokines include but are not limited to 
RANTES, MlP-la or MIP-1(3. As used herein, "RANTES", "MlP-la", 

10 and "MIP-1(3" denote members of the chemokine superfamily of 
proteins that direct the activation and migration of 
leukocytes and other cells involved in the inflammation. 
RANTES, MIP-lcx and MIP-lp are known to bind CCR5 and induce 
signaling. Their peptide sequences have been described 

15 (Wells et al. J. Leukocyte Biology, 59:53-60, 1996). 

In one embodiment of the above methods, the CCR5 ligand is 
an antibody. In one embodiment, the antibody is PA8 (ATCC 
Accession No. HB-12 605) . In another embodiment, the 
20 antibody is PA10 (ATCC Accession No. 12607). In another 
embodiment, the antibody is PA11 (ATCC Accession No. HB- 
12 6 08) . In another embodiment, the antibody is PA12 (ATCC 
Accession No. HB-12609) . 

25 This invention provides a compound having the structure: 

A- (aYDINYYTSE(3A) n 

wherein each T represents a threonine, each S represents a 
serine, each .E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
30 each I represents an isoleucine; and each N represents an 
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asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
5 forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein |3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 

10 have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein A 
represents a carboxyl group or an ami dated carboxyl group; 
wherein all of a, Y, D, I , N, Y, Y, T, S , E and (3 are joined 

15 together by peptide bonds, further- provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

20 

This invention also provides a compound having the 
structure : 

( GaYDINYYTSEp ) n - A 

wherein each T represents a threonine, each S represents a ( 
25 serine, each E represents a glutamic, acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
30 they are joined together by peptide bonds in consecutive 
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order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein 3 represents from 0 to 13 amino acids, with the 
5 proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: I beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein 0 
10 represents an amino group or an acetylated amino group; 
wherein all of a, Y, D, I ,N, Y, Y, T, S , E and p are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, wherein n is an integer from 1 to 8, 
15 A is a polymer, and the solid line represents up to 8 
linkers which attach the structure in parentheses to A. 

This invention provides a compound having the structure: 
A - (aYDINYYTSEpX) n 

20 wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 

25 with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction ; 

30 wherein 3 represents from 0 to 333 amino acids, with the 
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proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
5 therefrom in the carboxy terminal direction; wherein X 
represents a carboxyl group or an amidated carboxyl group; 
wherein all of a, Y , D, I , N, Y 7 Y, T, S , E and (3 are joined 
together by peptide bonds, further provided that at least 
.two tyrosines in the compound are sulfated, wherein n is an 
10 integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

i 

This invention also provides a compound having the 
15 structure: 

( SaYDINYYTSEp ) n " A 

wherein each T represents a threonine, each S represents a 
serine, each E represents. a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 

20 each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 

25 forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino, terminal direction; 
wherein (3 represents from 0 to 33 3 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 

30 have a sequence identical to the sequence set forth in SEQ 
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ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein 0 
represents an amino group or an acetylated amino group; 
wherein all of a, Y , D, I ,N, Y, Y, T, S , E and 3 are joined 
5 together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein u is an 
integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

10 

The polymer of the above compounds includes but is not 
limited to the following: a linear lysine polymer; a 
branched lysine polymers; a linear arginine polymer; a 
branched arginine polymer; and polyethylene glycol (PEG) , a 
15 linear acetylated lysine polymer, a branched acetylated 
lysine polymer, a linear chloroacetylated lysine polymer 
and a branched chloroacetylated lysine polymer. 

The above compounds can be produced by various methods 
20 known to those skilled in the art, including but not 
• limited to the following. Methods for producing synthetic 
multimeric peptides such as multiple antigen peptides, 
synthetic polymeric constructs, and branched lysine 
oligopeptides are well known to those skilled in the art 
25 (Spetzler and Tarn, Int. J . Pept . Prot . Res. 45:78, 1995; 
Yai et al . , J. Virol., 69:320, 1995; Okuda et al . , J. Mol . 
Recognit. 6:101, 1993). For example, radially branched 
peptides can be produced by performing standard solid-phase 
peptide synthesis methods using branched lysine skeletons 
30 on 4- (oxy-methyl) -phenylactamidomethyl or other suitable 



solid resin. Peptide chains are elongated in parallel in a 
stepwise fashion using optimized t -butyl oxycarbonyl /benzyl 
chemistry as described (Sabatier et al . , Biochemistry 
32:2763, 1993). Peptides are liberated from the resin, 
•purified by reversed-phase chromatography over a C18 or 
other suitable column and characterized by analytical HPLC 
and mass spectroscopy. In another approach, monomeric 
peptides are synthesized, purified, and then covalently 
coupled to lysine copolymers using N- succinimidyl maleimido 
carboxylate chemistry. In another approach, the peptides 
can also be made in the form of affinity type multimers. 
For example, peptides may be synthesized with an affinity 
tag such as biotin. These affinity tagged peptides can then 
be mixed with affinity ligands capable of binding' 
multimerically , such as streptravidin . Other site-specific 
ligation chemistries are known to the skilled artisan. 

This invention provides a compound comprising the 
structure : 

eaYDnnynnnEpx 

wherein each E represents a glutamic acid, each D 
represents an aspartic acid, and each Y represents a 
tyrosine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein (3 represents from 0 to 13 amino acids, with the 
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proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the P at position 19 and extending 
5 therefrom in the carboxy terminal direction; 

wherein 6 represents an amino group or an acetylated amino 
group; wherein A represents a carboxyl group or an ami dated 
carboxyl group; 

wherein n represents any amino acid, 
10 wherein all of ot / Y / D / n / n,Y / n / n / n,E and (5 are joined 
together by peptide bonds; 

further provided that at least two tyrosines in the 
compound are sulfated. 

15 In one embodiment of this compound, the compound comprises 
amino acids in addition to those in the YDIHIYnnnE peptide, 
and such amino acids correspond to those present in the 
CCR5 receptor sequence set forth in SEQ ID NO : 1 , yet an 
amino acid may be replaced with a homologous amino acid. 

20 The sequence YDnnYnnnE corresponds to amino acid residues 
10-18 of the sequence set forth in SEQ ID NO:l. For 
example, if the peptide has one additional amino acid on 
its N terminal end, then the sequence could be I YDnnYnnnE 
or alternatively, the I could be replaced with G, A, V or 

25 L. 

In one embodiment of the above compound, the compound is a 
peptide which comprises consecutive amino acids having the 
sequence YDnnYnnnE. 

30 



In one embodiment of the above compound, the tyrosines at 
positions 1 and 5 of the sequence YDnnYnnnE are sulfated. 

As used herein, "homologous amino acids" are those which 
have chemically similar side chains. For example, aliphatic 
side chains (G, A, V, L and I) ; aromatic side chains (F, Y 
and W) ; basic aide chains (K, R and H) ; acidic side chains 
(D and E) ; amide side chains (N and Q) ; aliphatic hydroxyl- 
containing side chains (S and T) ; sulfur-containing side 
chains (C and M) . Homology between amino acids may also be 
drawn on other bases, such as size, polarity, hydrogen 
bonding potential, hydrophilicity and hydrophobicity . 
Proline differs from the above amino acids in that it 
contains a secondary rather than primary imino group. 
Accordingly, proline may be considered an imino group. 
Substitution or proline with another amino acid (e.g. G, A 
or S) can increase the flexibility of a peptide . 
Conversely, substitution of another amino acid with a 
proline can stabilize a desired conformation. 

This invention provides a compound comprising the 

structure : 

eaYDINYYTSE&X 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
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joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 
5 wherein (3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
se q Uence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the P at position 19 and extending 
10 therefrom in the carboxy terminal direction; 

wherein 6 represents an amino group or an acetylated amino 
group; wherein X represents a carboxy 1 gr<pup or an amidated 
carboxyl group ; 

wherein all of a, Y , D, I , N, Y, Y, T, S , E and p are joined 
15 together by peptide bonds; 

further px'ovided that at least two tyrosines in the 
compound are sulfated, 

wherein any amino acid except for the Y at position 1, D at 
position 2, Y at position 5 and E at position 9 may be 
20 replaced with a homologous amino acid. 

- In one embodiment of the above compound, with respect to 
replacing homologous amino acids, any I amino acid residue 
may be replaced with a G,A,V or L amino acid residue. In 

25 one embodiment of the above compound, any N amino acid 
residue may be replaced with a Q amino acid residue. In one 
embodiment of the above compound, any Y amino acid residue 
may be replaced with a F or W amino acid residue. In one 
embodiment of the above compound, any T amino acid residue 

30 may be replaced with a S amino acid residue. In one 
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embodiment of the above compound, any S amino acid residue 
may be replaced with a T amino acid residue. In one 
embodiment of the above compound, any C may be replaced 
with M,S,T,A,G,N, or Q. 

5 

In one embodiment, a C amino acid residue within the .(3 
region of the compound may be replaced with any other amino 
acid. 

10 This invention provides an agent which binds to an epitope 
of HIV-1 gpl20, which epitope comprises amino acid residues 
R298, N301, T303, 1322, D324, 1325, R326, 1420, K421, Q422, 
W427, thex-eby inhibiting binding of HIV-1 gpl20 to a CCR5 
chemokine receptor. 

15 

The above amino acid numbering is per HIV-1 strain HxB2. 
(Genbank Accession No. AAB50262). Amino acids D324, 1325 
and R326 are derived from HIV-1 strain JR-FL (Genbank 
Accession No. AAB05604) . 

20 



25 



30 
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The amino acid sequence (SEQ ID NO: 17) for HIV-1 HxB2 gpl20 
is set forth below: 

1 MRVKEKYQHL WRWGWRWGTM LLGMLMICSA TEKLWVTVYY GVPVWKEATT TLFCASDAKA 
61 YDTEVHNVWA THACVPTDPN PQEWLVNVT ENFNMWKNDM VEQMHEDIIS LWDQSLKPCV 
5 121 KLTPLCVSLK CTDLKNDTNT NSSSGRMIME KGEIKNCSFN ISTSIRGKVQ KEYAFFYKLD 
181 IIPIDNDTTS YKLTSCNTSV ITQACPKVSF EPIPIHYCAP AGFAILKCNN KTFNGTGPCT 
241 NVSTVQCTHG IRPWSTQLL LNGSLAEEEV V1RSVNFTDN AKTIIVQLNT SVEINCTRPN 
3 01 NNTRKRIRIQ RGPGRAFVTI GKIGNMRQAH CNISRAKWNN TLKQIASKLR EQFGNNKTII 

3 61 FKQSSGGDPE IVTHSFNCGG EFFYCNTSTQL FNSTWFNSTW STEGSNNTEG SDTITLPCRI 
10 421 KQIINMWQKV GKAMYAPPIS GQIRCSSNIT GLLLTRDGGN SNNESEIFRP GGGDMRDNWR 

4 81 SELYKYKWK IEPLGVAPTK AKRRWQREK R 



The amino acid sequence (SEQ ID NO: 16) for HIV-1 JR-FL 
15 gpl20 is set forth below: 

1 MRVKGIRKSY QYLWKGGTLL LGILMICSAV EKLWVTVYYG VPVWKEATTT LFCASDAKAY 
61 DTEVHNVWAT HACVPTDPNP QEWLENVTE H FNM W KNTKFMV EQMQEDIISL WDQSLKPCVK 
121 LTPLCVTLNC KDVNATNTTN DSEGTMERGE IKNCSFNITT SIRDEVQKEY ALFYKLDWP 
181 IDNNNTSYRL ISCDTSVITQ ACPKISFEPI P I H Y CAP AG F AILKCNDKTF NGKGPCKNVS 
20 241 TVQCTHGIRP WSTQLLLNG SLAEEEWIR SDNFTNNAKT IIVQLKESVE INCTRPNNNT 
3 01 RKS IHIGPGR AFYTTGEIIG DIRQAHCNIS RAKWNDTLKQ IVIKLREQFE NKTIVFNHSS 

3 61 GGDPE I VMHS FNCGGEFFYC NSTQLFNSTW NNNTEGSNNT EGNTITLPCR IKQIINMWQE 
421 VGKAMYAPPI RGQIRCSSNI TGLLLTRDGG INENGTE1FR PGGGBMRDNW RSELYKYKW 

4 81 KIEPLGVAPT KAKRRWQRE KR 

25 

This invention provides the above agent, wherein the 
epitope is altered or masked by an alanine substitution of 
at least one of the amino acid residues R298, N301, T303, 
1322, D324, 1325, R326, 1420, K421, Q422 and W427. 

30 

This invention provides an agent which binds to an epitope 
of HIV-1 gpl20, which epitope comprises amino acid residues 
R298, N301, T303, 1322, D324, 1325, R326, 1420, K421, Q422, 
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W427, thereby inhibiting HIV-1 infection of a CD4+ CCR5+ 
cell. 

This invention provides the above agent, wherein the 
5 epitope is altered or masked by an alanine substitution of 
at least one of the amino acid residues R298, N3 01, T303, 
1322, D324, 1325, R326, 1420, K421, Q422 and W427. 

In one embodiment of any of the above agents, the agent is 
10 a peptide. In one embodiment of any of the above agents, 
the peptide comprises consecutive amino; acids having the 
sequence YDINYYTSE . In one embodiment at least two 
tyrosines in the compound are sulfated. In one embodiment, 
the tyrosines at positions 1 and 5 of the sequence 
15 YDINYYTSE are sulfated. 

In one embodiment of any of the above agents, the agent is 
an antibody or portion of an antibody. In one embodiment of 
any of the above agents, the agent is a nonpeptidyl agent. 
20 In one embodiment of ' any of the above agents, the agent is 
a peptidyl agent. 

This invention provides a method of inhibiting HIV-1 
infection of a CD4+CCR5+ cell which comprises contacting 
25 the cell . with an amount of an agent of the subject 
invention effective to bind to HIV-1 gp!20, so as to 
thereby inhibit HIV-1 infection of the CD4+ CCR5+ cell. 



30 



This invention provides a compound having one of the 
following structures : 



A- (aYDINYYTSEPX) , ( 9aYDINYYTSE3 ) - A, or A- (aYDINYYTSE(3 ) ~A 

wherein each' T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein (3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein X represents a carboxyl group or an amidated 
carboxyl group; 

wherein 6 represents an amino group or an acetylated amino 
group ; 

wherein all of a, Y, D, I , N, Y , Y, T, S , E and (3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein A is a molecule that self -oligomerizes , and the 
solid line represents a peptide linker or a peptide, 
disulfide, or other chemical bond. 
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As used herein "peptide linker" is a peptide comprising 
consecutive amino acids having a sequence which includes 
but is not limited to GAG, SGGRGG and QSTRGGASGGG or 
repeating units thereof. One skilled in the art would know 
5 other flexible peptide linkers. 

In one embodiment of the above compound, the peptide that 
self -ol igomerizes contains alpha-helical regions capable of 
forming coiled coils. 

10 

The a-helical coiled coil (48) is probably the most 
widespread subunit oligomer! zat ion motif found in proteins 
(48-52) . It is a type of protein structure consisting of 
two to five amphipathic a-helices that "coil" around each 
15 other in a left-handed supertwist (48-52) . The sequences of 
coiled are characterized by a heptad repeat of seven 
residues with a hydrophobic repeat of mostly apolar amino 
acids . 

20 In one embodiment of the above compound, the peptide that 
self -oligornerizes is a peptide having a sequence of at 
least a portion HIV-1 gp41 heptad repeat sequence 1. In one 
embodiment, the HIV-1 gp41 heptad repeat sequence 1 is 
RQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ (SEQ ID 

25 NO: 3) . 

In one embodiment of the above compound, the peptide that 
self -oligornerizes is a peptide having a sequence of at 
least a portion of an HIV-1 gp41 heptad repeat sequence 2. 
30 In one embodiment, the HIV-1 gp41 heptad repeat sequence 2 
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is WMEWDREINNYTSLIH'SLIEESQNQQEKNEQELLE (SEQ ID NO: 4) . 

In one embodiment of the above compound, the peptide that 
self -oligomerizes is a peptide having a sequence 
corresponding to at least a portion of the leucine zipper 
region of transcription factor GCN4 . In one embodiment, 
the sequence of the leucine zipper region of transcription 
factor GCN4 is HMKQLEDKVEELLS KNYHLENEVARLKKLVGER (SEQ ID 
NO : 6 ) . 

In one embodiment of the above compound, j the peptide that 

self-oligomerizes is a peptide having a sequence 

i 

corresponding to at least a portion of the leucine zipper 
region of transcription factor GCN4 . In one embodiment, the 
sequence is derived from the leucine . zipper region of 
transcription factor GCN4 . In one embodiment, the sequence 
forms trimeric coiled-coils . In one embodiment, the 
sequence is HMKQIEDKIEEILSKIYHIENEIARIKKLIGEV (SEQ ID 

NO: 7) . 



In one embodiment of the above compound, the peptide that 
self-oligomerizes is a peptide having a sequence 
corresponding to at least a portion of a leucine zipper 
region of a human protein. The human protein includes but 
25 is not limited to transcription activator c-fos, 
transcription activator c-jun, enzyme quiescent cell 
proline dipeptidase, macrophage scavenger receptor, 
salivary mucin (MUC7) , or human quiescent cell proline 
dipeptidase (QPP) - In one embodiment, the human protein is 
QPP and the leucine zipper region has the sequence 



30 
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LLTVEQALADFAELLRALRRDL (SEQ ID NO: 5). In one embodiment, 
the transcription activator is c-fos and the leucine zipper 
region h ^s the sequence 

LTDTLQAETDQLEDEKSALQTEIANLLKEKEKLEFILAAR (SEQ ID NO: 8) . In 
one embodiment, the transcription activator is c-jun and 
the leucine zipper region has the ' sequence 
HMRRIARLEEKVKTLKAQNSELASTANMLREQVAQLKQKY (SEQ ID NO: 9) . 

In one embodiment of the above compound, the peptide that 
self-oligomerizes is a peptide having a sequence 
corresponding to that of at least a portion of an antibody. 
In one embodiment, the portion of the antibody comprises 
the heavy chain. In one embodiment, the portion of the 
antibody heavy chain comprises the heavy chain constant 
region. In one embodiment, the portion of the antibody 
heavy chain comprises the hinge and Fc domains. In one 
embodiment, the portion of the antibody ' heavy chain 
comprises the Fc domain. In one embodiment, the portion of 
the antibody comprises the light chain. In one embodiment, 
the portion comprises the light chain constant region. In 
one embodiment, the portion of the antibody comprises the 
heavy and light chains. 

This invention provides a compound having one of the 
following structures: 

A- (aYDINYYTSE3X) , (6aYDINYYTSE(3) — A, or A- (aYDINYYTSE|3 ) - A 
wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
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asparagine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
5 have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction ; 

wherein (3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
10 joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein A represents a carboxyl group or an amidated 
15 carboxyl group ; 

wherein 0 represents an amino group or an acetylated amino 
group ; 

wherein all of a, Y , D, I , N, Y , Y, T, S , E and p are joined 
together by peptide bonds, 
20 further provided that at least two tyrosines in the 
• compound are sulfated, 

wherein A is toxin, and the solid line represents a peptide 
linker or a peptide, disulfide, or other chemical bond. 

25 In one embodiment of the above compound, the toxin is a 
radionuclide. In one embodiment, the radionuclide is an 
alpha-emitting isotope. The alpha-emitting isotope includes 
but is not limited to 225 Ac, 211 At, 212 Bi , or 213 Bi . In one 
embodiment, the radionuclide is a beta-emitting isotope. 

30 The beta-emitting isotope includes but is not limited to 
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radionuclide may be emitting Auger and low energy electron. 
The radionuclide includes but is not limited to 131 I / 125 l or 
77 Br . 

In one embodiment of the above compound, the toxin is a 
chemical toxin. The chemical toxin may be a peptidyl 
chemical toxin. The peptidyl chemical toxin includes but is 
not limited to ricin. The chemical toxin may be a 
nonpeptidyl chemical toxin. The nonpeptidyl chemical toxin 
includes but is not limited to calicheamycin . 

This invention provides a compound having one of the 
following structures : 

A- (QYDINYYTSEPX) , ( 6aYDINYYTSE(3 ) - A, or A- (aYDINYYTSEp ) -A, 

wherein each T represents a threonine, each S represents a 
.serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
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ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein A represents a carboxyl group or an amidated 
carboxyl group ; 

5 wherein 9 represents an amino group or an acetylated amino 
group; 

wherein all of a, Y, D, I , N, Y, Y, T, S , E and (3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
10 compound are sulfated, 

wherein A is molecule with anti-HIV activity, and the solid 
line represents a peptide linker or a peptide, disulfide, 
or other chemical bond. 

15 In one embodiment of the above compound, the molecule with 
anti-HIV activity is a CD4 - immunoglobulin fusion protein. 
In one embodiment, the CD4 -immunoglobulin fusion protein is 
CD4-IgG2, wherein the CD4-IgG2 comprises two heavy chains 
and two lights chains, wherein the heavy chains are encoded 

20 by an expression vector designated CD4 - IgG2HC-pRcCMV (ATCC 
Accession No. 75193) and the light chains are encoded by an 
expression vector designated CD4 -kLC-pRcCMV (ATCC Accession 
No. 75194) . 

25 In one embodiment of the above compound, the molecule with 
anti-HIV activity is a compound which retards gp41 from 
adopting a conformation capable of mediating fusion of HIV- 
1 to a CD4+ cell by binding noncovalently to an epitope on 
a gp41 fusion intermediate. In one embodiment, the compound 

30 comprises a peptide selected from the group consisting of 
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T-20 (SEQ ID NO: 10), DP107 (SEQ ID NO: 11), N34 (SEQ ID 
NO: 12), C28 (SEQ ID NO: 13), N34 (L6) C28 (SEQ ID NO: 14), 
and T1249 (SEQ ID NO:15). 

5 As used herein, "T-2 0' 1 and "DP178" are used interchangeably 
to denote a peptide having the following amino acid 
sequence : YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF (SEQ ID 
NO:10) and as described [53,54]. 

10 DP107 has the following amino acid sequence: 

NNLLRA I E AQQHLLQLT VWG I KQLQAR I LAVERYLKDQ (SEQ ID NO: 11) 

N34 has the following amino acid sequence: 

SGI VQQQNNLLRA I EAQQHLLQLTVWG I KQLQAR (SEQ ID NO: 12) . 

15 

C2 8 has the following amino acid sequence: 
WMEWDREINNYTSLIHSLIEESQNQQEK (SEQ ID NO: 13) 

N34(L6)C28 has the following amino acid sequence: 
20 SGIVQQQNNLLRAI EAQQHLLQLTVWG I KQLQARSGGRGGWMEWDREINNYTSLIHSLI 
■ EESQNQQEK (SEQ ID NO: 14) 

T1249 has the following amino acid sequence : 
WQEWEQKITALLEQAQIQQEKNEYELQKLDKWASLWEWF (SEQ ID NO: 15) 

25 

This invention provides the above compound wherein the 
molecule with anti-HIV activity is a CCR5 chemokine 
receptor targeting agent. 



30 In one embodiment, the CCR5 chemokine receptor targeting 




-69- 

agent is an antibody or portion of an antibody. In one 
embodiment, the antibody includes but is not limited to PA8 
(ATCC Accession No. HB-12605), PA10 (ATCC Accession 
No. 12607), PA11 (ATCC Accession No. HB-12608) , PA12 (ATCC 
5 Accession No. HB-12609) , and PA14 (ATCC Accession No. HB- 
12 610) . In one embodiment, the antibody is PA14 (ATCC 
Accession No. HB-12610) . 

The antibody may be a monoclonal antibody or polyclonal 
10 antibody. The monoclonal antibody may be a human, humanized 
or chimeric antibody. This invention provides humanized 
forms of the above antibodies. 

As used herein, "humanized" describes antibodies wherein 

15 some, most or all of the amino acids outside the CDR 
regions are replaced with corresponding amino acids derived 
from human immunoglobulin molecules. In one embodiment of 
the humanized forms of the antibodies, some, most or all of 
the amino acids outside the CDR regions have been replaced 

20 with amino acids from human immunoglobulin molecules but 
where some, most or all amino acids within one or more CDR 
regions are unchanged. Small additions, deletions, 
insertions, substitutions or modifications of amino acids 
are permissible as long as they would not abrogate the 

25 ability of the antibody to bind a given antigen. Suitable 
human immunoglobulin molecules would include IgGl, IgG2 , 
IgG3, IgG4 , IgA and IgM molecules. A "humanized" antibody 
would retain a similar antigenic specificity as the 
original antibody, i.e., in the present invention, the 

30 ability to bind CCR5 . 



One skilled in the art would know how to make the humanized 
antibodies of the subject invention. Various publications, 
several of which are hereby incorporated by reference into 
this application, also describe how to make humanized 
antibodies. For example, the methods described in United 
States Patent No. 4,816,567 (55) comprise the production of 
chimeric antibodies having a variable region of one 
antibody and a constant region of another antibody. 

United States Patent No . 5 , 225 , 539 (56) describes another 
approach for the production of a humanized antibody. This 
patent describes the use of recombinant DNA technology to 
produce a humanized antibody wherein the CDRs of a variable 
region of one immunoglobulin are replaced with the CDRs 
from an immunoglobulin with a different specificity such 
that the humanized antibody would recognize the desired 
target but would not be recognized in a significant way by 
the human subject's immune system. Specifically, site 
directed mutagenesis is used to graft the CDRs onto the 
framework . 

Other approaches for humanizing an antibody are described 
in United States Patent Nos . 5,585,089 (57) and 5,693,761 
(58) and WO 90/07861 which describe methods for producing 
humanized immunoglobulins. These have one or more CDRs and 
possible additional amino acids from a donor immunoglobulin 
and a framework region from an accepting human 
immunoglobulin. These patents describe a method to increase 
the affinity of an antibody for the desired antigen. Some 
amino acids in the framework are chosen to be the same as 
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the amino acids at those positions in the donor rather than 
in the acceptor. Specifically, these patents describe the 
preparation of a humanized antibody that binds to a 
receptor by combining the CDRs of a mouse monoclonal 
5 antibody with human immunoglobulin framework and constant 
regions . Human framework regions can be chosen to maximize 
homology with the mouse sequence. A computer model can be 
used to identify amino acids in the framework region which 
are likely to interact with the CDRs or the specific 
10 antigen and then mouse amino acids can be used at these 
positions to create the humanized antibody.. 

The above patents 5,585,089 and 5,693,761, and WO 90/07861 
(59) also propose four possible criteria which may used in 

15 designing the humanized antibodies. The first proposal was 
that for an acceptor, use a framework from a particular 
human immunoglobulin that is unusually homologous to the 
donor immunoglobulin to be humanized, or use a consensus 
framework from many human antibodies. The second proposal 

20 was that if an amino acid in the framework of the human 
immunoglobulin is unusual and the donor amino acid at that 
position is typical for human sequences, then the donor 
amino acid rather than the acceptor may be selected. The 
third proposal was that in the positions immediately 

25 adjacent to the 3 CDRs in the humanized immunoglobulin 
chain, the donor amino acid rather than the acceptor amino 
acid may be selected. The fourth proposal was to use the 
donor amino acid reside at the framework positions at which 
the amino acid is predicted to have a side chain atom 

30 within 3A of the CDRs in a three dimensional model of the 



antibody and is predicted to be capable of interacting with 
the CDRs. The above methods are merely illustrative of some 
of the methods that one skilled in the art could employ to 
make humanized antibodies. 

This invention provides the above compound, wherein the 
portion of the antibody is a Fab fragment of the antibody. 
This invention provides the above compound, wherein the 
portion of the antibody comprises the variable domain of 
the antibody. This invention provides the above compound, 
wherein the portion of the antibody comprises a 
complementarity determining region or CDR portion of the 
antibody. The monoclonal antibody includes but is not 
limited to an IgG, IgM, IgD, IgA, or IgE monoclonal 
antibody . 

This invention provides the above compound, wherein the 
molecule with anti-HIV activity is a chemokine or chemokine 
derivative. The chemokine includes but is not 'limited to 
RANTES, MlP-la, MIP-lp, SDF-1 or other chemokine which 
blocks HIV-1 infection. The chemokine derivative includes 
but is not limited to Met -RANTES , AOP-RANTES, RANTES 9-68, 
or NNY- RANTES . 

The molecule may also be a non- chemokine agent capable of 
binding to chemokine receptors and inhibiting fusion of 
HIV-1 to CD4 + cells. The non-chemokine agents include, but 
are not limited to, chemokine fragments and chemokine 
derivatives and analogues. in one embodiment, the agent 
does not include naturally occurring chemokines . The non- 
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chemokine agents include multimeric forms of the chemokine 
fragments and chemokine derivatives and analogues or fusion 
molecules which contain chemokine fragments, derivatives 
and analogues linked to other molecules. In one 

5 embodiment, the non-chemokine agents do not include 
bicyclams and their derivatives as described in U.S. Patent 
No. 5,021,409, issued June 4, 1991, the content of which is 
incorporated by reference into this application. Some 
bicyclam derivatives have been previously described with 
10 antiviral activities (60, 61). 

In an embodiment of this invention, the non-chemokine agent 
is an oligopeptide. In another embodiment, the non- 
chemokine agent is a polypeptide. In still another 

15 embodiment, the non-chemokine agent is an antibody or a 
portion thereof. Antibodies against the chemokine receptor 
may easily be generated by routine experiments. It is also 
within the level of ordinary skill to synthesize fragments 
of the antibody capable of binding to the chemokine 

20 receptor. In a further embodiment, the non-chemokine agent 
is a nonpeptidyl agent such as TAK-779 (64) or AMD3100 
(65) . 

Non-chemokine agents which are purely peptidyl in 
25 composition can be either chemically synthesized by solid- 
phase methods (62) or produced using recombinant technology 
in either prokaryotic or eukaryotic systems. The synthetic 
and recombinant methods are well known in the art. 

30 Non-chemokine agents which contain biotin or. other 
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nonpeptidyl groups can be prepared by chemical modification 
of synthetic or recombinant .chemokines or non-chemokine 
agents. One chemical modification method involves 

periodate oxidation of the 2 -amino alcohol present on 
chemokines or non-chemokine agents possessing serine or 
threonine as their N-terminal amino acid (63). The 
resulting aldehyde group can be used to link peptidyl or 
non-peptidyl groups to the oxidized chemokine or non- 
chemokine agent by reductive amination, hydrazine, or other 
chemistries well known to those skilled in the art. 

This invention provides a compound having one of the 
following structures : 

A- (aYDINYYTSE3X) , (0aYDINYYTSE3 ) ~A, or A- (aYDINYYTSEp) — A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine ; 

wherein a. represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein (3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
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therefrom in the carboxy terminal direction; 

wherein X represents a carboxyl group or .an amidated 
carboxyl group ; 

wherein 0 represents an amino group or an acetylated amino 
5 group; 

wherein all of a, Y , D , I / N / Y,Y,T / S # E and (3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 
10 wherein A is peptidyl or nonpeptidyl agent, and the solid 
line represents a peptide linker, or a peptide, disulfide, 
or other chemical bond. 

In one embodiment, the A is a nonpeptidyl agent, and the 
15 nonpetidyl agent polyethylene glycol. 

This invention will be better understood from the 
Experimental Details that follow. However, one skilled in 
the art will readily appreciate that the specific methods 
20 and results discussed are merely illustrative of the 
invention as described more fully in the claims that follow 
thereafter . 
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EXPERIMENTAL DETAILS 

First Series of Experiments 
A . Materials 

5 Purified recombinant CD4-IgG2 protein was produced by 
Progenies Pharmaceuticals, Inc. from plasmids CD4-IgG2-HC- 
pRcCMV and CD4 -kLC-pRcCMV as described (Allaway et al . AIDS 
Res. Hum. Retroviruses 11:533, 1995). Soluble CD4 is 
commercially available (NEN Life Science Products, Boston, 
10 MA) . Anti-CCR5 MAb 2D7 was purchased from Pharmingen (San- 
Diego, CA) . 

The plasmids designated PPI4 - tPA-gpl2 0jr.fj.-V3 <"> and PPI4-tPA- 
gpl2 0 DH123 were prepared as described (Hasel et al, US 

15 Patents 5,869,624 and 5,886,163). Monomeric gpl20 

glycoproteins were produced in CHO cells stably transfected 
with the PPI4»tPA-gpl20 plasmids and purified to 
homogeneity as described (Hasel et al . US Patents 5,869,624 
and 5,886,163; Trkola et al , Nature 384:184, 1996). The 

20 antibodies designated PA8 , PA10, PA12 and PA14 were 
prepared by growing the corresponding hybridoma cell line 
in mouse ascites and isolating the antibody using protein A 
affinity chromatography as described (Olson et al . J.Virol. 
73:4145, 1999). LI . 2 - CCR5 + cell s were cultured as described 

25 (Olson et al . J.Virol. 73:4145, 1999). 

Peptides containing different segments of the CCR5 Nt were 
custom- synthesized by solid-phase f luorenylmethoxycarbonyl 
chemistry using phospho- and sulfo- tyrosine precursors as 
30 building blocks where indicated (Figure 6) . Biotinylated 
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versions of peptides S-10/14 and P-10/14 incorporated a C- 
terminal GAG spacer preceding a biotinylated lysine. 
Following cleavage from the resin, peptides were purified 
by reverse -phase chromatography on CIS columns (Vydac, 
5 Hesperia, CA) and analyzed by HPLC and mass spectroscopy. 
Figure 6 describes the different peptides that were used in 
this study. 

Binding of apl20 to CCR5 

10 A gpl20/CD4 complex formed from monomeric gpl20 (lOOnM) and 
biotinylated CD4-IgG2 (50nM) was added to lxlO 6 L1.2-CCR5+ 
cells in the presence of different concentrations of 
peptide (Olson et al . J.Virol. 73:4145, 1999). CD4-IgG2 is 
tetrameric and therefore binds four molecules of gpl2 0, 

15 which increases binding of the complex to CCR5 (Allaway et 
al . AIDS Res. Hum. Retroviruses 11:533, 1995). The mean 
fluorescence intensity (m.f.i.) was measured by flow 
cytometry after addition of phycoerythrin (PE) -labeled 
streptavidin (Becton Dickinson, San Jose, CA) . Inhibition of 

20 gpl20/CCR5 binding was calculated: (m.f.i. with 
peptide) / (m. f . i . without peptide) xl00%. 

It was first tested whether tyrosine- sulfated peptides 
spanning amino acids 2-18 of the CCR5 Nt could inhibit 

25 binding of the gpl2 0 JR _ FL /CD4 - IgG2 complex to CCR5 + cells. The 
HIV-1 JR . FL isolate exclusively uses CCR5 as a co-receptor 
(Dragic et al . Nature 381:667, 1996). Only peptides S- 
3/10/14 and S- 10/14 inhibited complex binding to the cells 
in a dose-dependent manner (Fig. la). Peptides S-10 and S- 

30 14 had no inhibitox~y activity, even at the highest 
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concentrations (Fig. la). Peptide TS-10/14, spanning amino 
acids 10-14, did not inhibit gpl2 0jk.pl/CD4 -IgG2 binding to 
CCR5 + cells, despite the presence of two sulfo- tyrosine 
residues (Fig. lb) . 

5 

Tyrosine -phosphorylated peptides P-10/14 and P-3/10/14 did 
not inhibit gpl2 0 JR . FL /CD4 - IgG2 binding to CCR5 + cells (Fig. 
lb) . As further specificity controls we synthesized 
peptides containing the first seventeen residues of the 
10 CCR5 Nt in random order with sulfo- tyrosines in positions 
10 and 14 (SS-10/14) or in positions 2 and 12 (SS-2/12) . 
Neither one of these peptides reduced t gpl2 0jr.pl/CD4 - IgG2 
binding to CCR5 + cells, even at the highest concentrations 
(Fig. lb) . 

15 

Surface plasmon resonance measurements (BlAcore) 
Streptavidin-coated sensor chips (BlAcore AB, Sweden) were 
conditioned with five injections of regeneration solution 
(1M NaCl, 50mM NaOH) and equilibrated with HBS-EP buffer 

20 (lOmM KEPES, 150mM NaCl , 3M EDTA, 0.005% polysorbate 20) as 
recommended by the manufacturer. Biotinylated peptides were 
then immobilised on the chip by injection of peptide 
(lOOnM) in HBS-EP buffer, followed by an injection of 
.regeneration solution and equilibration with HBS-EP buffer. 

25 4 00 resonance units (RU) of peptide were bound to the 
sensor chip surface. Solutions of the following proteins 
(lOOnM) were passed over the sensor chip surface: gpl20, 
sCD4, gpl2 0/sCD4 / PA8 , PA10 and 2D7 . Surface plasmon 
resonance was monitored and displayed in arbitrary 

30 resonance units (RU) as a function of time. Following 



injection of each solution the chip was regenerated and 
equilibrated as described above. 

Biotinylated peptide was attached to the streptavidin- 
coated gold surface of a sensor chip and solutions 
containing different gpl20/sCD4 complexes were flowed over 
the immobilized peptide. Adsorption of the complex due to 
complex/peptide binding was detected by an increase in 
surface - plasmon resonance signal (RU) , which reports 
changes in the effective refraction index very near the 
gold surface of the sensor chip (Schuck Ann. Rev. Biophys 
Biomol Struct 26:541, 1997). For proteins of similar size, 
such as the different gp!20/sCD4 complexes, RU plateau 
values are directly proportional to the amount of protein 
bound to the peptide. 

Specific association of the gpl2 0 JR _ FL /sCD4 complex with the 
sulf o- tyrosine-containing peptide bS- 10/14 was accompanied 
by a significant increase in RU (Fig. 2a). The signal 
plateau but not the shape of the sensograms varied with 
gpl2 0 JR _ FL /sCD4 concentration indicating that the 

peptide/complex interaction was dose -dependent (data not 
shown) . The sensorgram obtained with bP- 10/14 is similar 
to the one obtained in the absence of peptide, indicating a 
complete lack of association of the phcsphorylated peptide 
with the protein complex (Fig. 2a). Neither gpl20 JR . FL nor 
sCD4 alone produced a significant increase in RU, 
indicating that they did not associate with the immobilized 
peptides. (Fig. 2b,c). The gpl2 0-AV3jr.pl/sCD4 complex was 
also unable to associate with the peptides (Fig. 2d) . 
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To further ascertain the specificity of the peptide/complex 
association we performed BIAcore analyses using envelope 
glycoproteins from HIV-1 DH123/ an R5X4 isolate, and HIV-1^, 
an X4 isolate (5) . gpl2 0 DH123 /s'CD4 associated specifically 
5 with the sulfated peptide, although the plateau RU values 
were lower than those observed with gpl 2 0jr.pl/sCD4 (Fig. 
2e) . We did not detect any binding of gpl20 DH123 /sCD4 to the 
phosphorylated peptide (Fig.2e), nor did gpl20 DH123 alone 
associate with the peptides (Fig. 2f ) . Finally, gpl20 LAI with 
10 or without sCD4 was not able to associate with either one 
of the peptides (Fig. 2g,h). 

These methods could be readily modified to screen for 
agents that bind CCR5 or that block its interaction with 

15 antibodies, gpl20 or other ligands. For example, direct 
binding of the agents could be analyzed as described above, 
where the agent is substituted for the anti-CCR5 antibody 
or gpl2 0/sCD4 complex. In another embodiment, the agent 
could be mixed or pre-incubated with the anti-CCR5 antibody 

20 (or gpl20/sCD4 complex) prior to passing the mixture over 
■ biosensor chips as described above. 

Binding of MAbs to CCR5 

L1.2-CCR5 cells (IxlO 6 ) were incubated with anti-CCR5 MAb 
25 (50nM) ± peptide (100/iM) . MAb binding was detected using a 
PE-labeled goat anti-mouse antibody (Caltag Laboratories, 
Burlingam, CA) . The m.f.i value was measured by flow 
cytometry as described (Olson et al . J. Virol. 73:4145, 
1999) . MAb binding was calculated as above. 
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We determined whether the CCR5 Nt peptides could inhibit 
binding of a panel of anti-CCR5 MAbs to CCR5 + cells, pa 8 
binding was reduced significantly by all wild-type peptides 
containing amino acids 2-18, regardless of tyrosine 
modification (Fig. 3) . BIAcore analysis confirmed that PA 8 
similarly and specifically associated with both sulfated 
and phosphorylated peptides (Fig. 4) . Binding of PA12 to 
CCR5 was not inhibited by any of the peptides (Fig. 3). 
PA10 binding to CCR5 was inhibited only by S-3/10/14 (Fig. 

3) . PA10 was also observed to associate with bS-10/14 and 
to a lesser extent with bP-10/14 in BIAcore analysis (Fig. 

4) , which may be more sensitive than the gpl20/CCR5-binding 
assay. Binding of 2D7 to CCR5 was not inhibited by any of 
the peptides (Fig. 3) . No significant interaction was 
observed between any CCR5 Nt peptide and Mab 2D7 (Figs. 3 
and 4) , whose epitope resides within the second 
extracellular loop on CCR5 . 

Single cycle HIV-1 entry assay 

Nlluc + env* particles pseudotyped with envelope glycoproteins 
from MuLV, HTLV-1 and HIV-1 strains JR-FL, HxB 2/ DH123, Gun- 
1 were made as described (Dragic et al . J. Virol". 72:279, 
1998) . Target cells (Hela-CD4 + CCR5 + or U8 7 -CD4 4 CCR5 + ) were 
incubated with virus-containing supernatant fractions 
(lOOng/ml p24) + peptide (100/zM) for 4 h. then washed and 
resuspended in culture media. After 4 8 hours the cells were 
lysed and lucif erase activity (relative light units, 
r.l.u.) was measured using a standard kit (Promega, 
Madison, WI) as described (Dragic et al . J. Virol. 72:279, 
1998). Viral entry was calculated: (r.l.u. with 
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peptide) / (r . 1 . u without peptide) xl00%. 

The ability of different CCR5 Nt peptides to inhibit HIV-1 
entry into CD4 + CCR5 + CXCR4 + cells was tested using a 
5 lucif erase-based single round of entry assay (5) . Only 
peptides S-10/14 and S-3/10/14 inhibited the entry of the 
R5 isolate HIV-1 JR _ FL by approximately 50% in HeLa-CD4 + CCR5 + 
and U87MG-CD4 + CCR5 + (Fig. 5 and data not shown) . We were 
unable to inhibit the entry of the R5X4 isolates HIV-1 DH123 
10 and HIV-l Gun _ a/ or of the X4 isolate HIV-l,^. The entry of 
MuLV and HTLV pseudotypes was also unaffected by the 
peptides (Fig. 5). 

15 Screening assays 

1) HIV-1 gpl20/CD4-IgG2 

Streptavidin-coated 96-well microtiter plates (NEN Life 
Science Products, Boston, MA) are blocked with 200 /il/well 
of 5% bovine serum albumin (Sigma, St. Louis, MO) in PBS 

20 buffer and washed with assay buffer (0.5% Tween 20, 1 % 
fetal bovine serum, and 2% BSA in PBS buffer) . The plates 
are then incubated 1 hour at ambient temperature with 100 
/il/well of biotinylated CCR5 N-terminal sulfopeptide at a 
concentration of 500 fiM in assay buffer. Following a wash 

25 step, the plates are incubated for 1 hour at ambient 
temperature with an HIV-l JR . FIj gp!20/CD4 - IgG2 complex in the 
presence or absence of inhibitory agent. The plates are 
again washed and incubated for 3 0 minutes with a 
horseradish peroxidase- labeled goat antibody to human IgG 

30 (Kirkegaard & Perry Laboratories, Gaithersburg , MD) 
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followed by addition of the TMB (3,3' ,5,5'- 
tetramethylbenzidine) chromogenic substrate (Pierce) . The 
reaction is stopped by addition of 100 /il/well of 2N H 2 S0 4 
prior to colorimetric detection at a wavelength of 450 nm. 
Wells without biotinylated peptide serve as negative 
controls. The percent inhibition of binding is calculated 

aS [1 " (OD witn inhibitor ~ O^ccntrol well) / (OD without inhibitor ~ OD contTol 

wen)] x 100, where OD represents the average optical density 
observed for the indicated wells. 

2 ) Ant i - CCR5 ant ibodi es 

Streptavidin-coated microtiter plates are blocked and 
incubated with CCR5 N-terminal peptide as described above. 
Following a wash step, the plates are incubated for one 
hour at ambient temperature with the anti-CCR5 antibody 
PA10 in the presence or absence of inhibitory agent. The 
plates are again washed and incubated for 3 0 minutes with a 
horseradish peroxidase-labeled goat antibody to mouse IgG 
(Kirkegaard & Perry Laboratories, Gaithersburg, MD) 
followed by addition of TMB substrate for colorimetric 
detection as described above. The percent inhibition 
mediated by the inhibitory agent is calculated as described 
above . 

Discussion 

Tyrosine-modif ied peptides spanning the region of the CCR5 
Nt that contains residues important for viral entry were 
synthesized. (Dragic et al . J. Virol. 72:279, 1998; Rabut 
et al . J. Virol. 72:3464, 1998; Farzan et al . J. Virol. 
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72:1160, 1998; Dorantz et al . J. Virol. 71:6305, 1997). 
Interactions between the Nt peptides and gp!20/CD4 
complexes were characterized. Peptides containing sulfo- 
tyrosines in positions 10 and 14 efficiently inhibited 
5 binding of gpl2 0 JR _ FL /CD4 to CCR5 . Substitution of the 
sulfate groups for phosphates, which are also negatively 
charged at physiological pH, rendered the Nt peptides 
inactive. Inhibition of gpl20/CCR5 binding was dependent, 
therefore, on the presence of sulfate moieties and was not 

10 simply due to non-specific electrostatic interactions 
between the peptide and the gpl20/CD4; complex or the 
peptide and the cell surface. Inhibition of gpl20/CCR5 
binding was also dependent on the. primary structure 
surrounding the sulfo- tyrosines since peptides with random 

15 sequences of CCR5 amino acids 2-18 had no inhibitory 
activity. Additional Nt amino acids in the region 2-18 were 
important for activity since a shortened peptide containing 
just amino acids 10-14 was unable to inhibit gpl20/CD4 
binding, despite the presence of two sulf o-tyrosines . It 

20 would be straightforward to define the minimum number of 
amino acids needed for activity by systematically 
synthesizing sulf opept ides intermediate in length between 
peptide 2-18 and peptide 10-14. Similarly, sulf opeptides 
that incorporate a greater portion of the CCR5 Nt could be 

25 easily synthesized and tested for activity using the 
methods described herein. 

Qualitative BIAcore analyses allowed the demonstration of a 
highly specific, CD4 -dependent interaction between a 
30 tyrosine- sulf ated Nt peptide and gpl2 0 JR . FL . No binding of 
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the protein complex to a tyrosine-phosphorylated peptide 
was observed. Only gpl2 0s derived from isolates that use 
CCR5 as a co-receptor associated with the sulfated peptide. 
gpl2 0 DHI23 /CD4 binding was weaker than gpl2 0 JR _ FL /CD4 binding, 
suggesting that envelope glycoproteins from R5X4 isolates 
have a lower apparent affinity for CCR5 than envelope 
glycoproteins from R5 isolates. gp!20 LAI/ derived from an 
isolate that only uses CXCR4 , did not bind to the sulfated 
peptide. A V3 loop-deleted gpl20 JR . FL did not associate with 
the sulfated peptide, just as this protein was unable to 
bind to full length CCR5 on the cell surface (Trkola et al . 
Nature 384:184, 1996). 

The binding of the Nt peptides to several anti-CCR5 MAbs , 
all of which recognize conformational epitopes in CCR5 and 
inhibit gpl2 0/CCR5 binding were also studied. PA12 and 2D7 
did not bind to any of the peptides. Binding of PA8 to the 
peptides was independent of tyrosine-modif ication whereas 
PA10 associated more with the sulf o- tyrosine-containing 
peptide than with the phospho - tyros ine - containing peptide. 
It seems, therefore, that sulf o- tyrosines and phospho- 
tyrosines are relatively interchangeable for the. purpose of 
MAb binding but that gpl2 0/CD4 binding has an absolute 
requirement for sulf o- tyrosines . Relatively subtle 
differences in size and geometry of sulfate and phosphate 
groups might be relevant for binding of the CCR5 Nt with 
gpl2 0, which must not only accept the negative charge, but 
also coordinate, probably by hydrogen bonds, the tyrosine 
sulfate oxygens. The kinetics of MAb binding to the CCR5 Nt 
peptides exhibited large apparent on rates and slow 
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10 



apparent off rates, which also differed from our 
observations of gpl20/CD4 binding kinetics. 

None of the Nt peptides inhibited MuLV, HTLV and HIV-l^ 
envelope-mediated viral entry, which is not mediated by 
CCR5. In contrast, peptides S-10/14 and S-3/10/14 
specifically inhibited the entry of the HIV-Ijr.fl R5 strain 
in two different cell lines. The inhibition of HIV-1 entry 
by tyrosine-sulfated peptides was partial (-50%) but 
nonetheless striking given the difficulty of blocking this 
process with short, linear peptides (Jameson et al . Science 
240:1335, 1988; Chan and Kim Cell 93:681:1998; Doranz et 
al. J. Exp. Med. 186:1395, 1997; Heveker et al . Current 
Biology 8:369, 1998; Eckert et al . Cell 99:1, 1999). 
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Second Series of Exp eriments 

CD4 and CCR5 mediate fusion and entry of R5 HIV-1 strains. 
Sulf otyrosine and other negatively charged residues in the 
CCR5 amino- terminal domain (Nt) are crucial for gpl2 0 
5 binding and viral entry. It is shown' that a soluble 
gpl20/CD4 complex specifically binds to a peptide 
corresponding to CCR5 Nt residues 2-18, with sulf otyrosines 
in positions 10 and 14. This sulfopeptide also inhibits 
soluble gpl20/CD4 binding to cell surface CCR5 as well as 
10 infection by R5 virus. These observations prompted the 
further delineation of the determinants iof the gpl20-CCR5 
Nt sulfopeptide interaction. It is shown that residues 10- 
18 constitute the minimal domain of the CCR5 Nt that is 
able to specifically interact with soluble g!20/CD4 
15 complexes. In addition to sulf otyrosines in. positions 10 
and 14, negatively charged residues in positions 11 and 18 
participate in this interaction. Furthermore, the CCR5 Nt 
binds to a CD4 -induced surface on gpl20 that is composed of 
conserved residues in the V3 loop stem and the C4 domain. 
20 Binding of gpl20 to cell surface CCR5 , however, is further 
influenced by variable residues in the crown of the V3 
loop. This data suggest that gpl20 docking to CCR5 is an 
interdependent, multi-step process involving different 
regions of the envelope glycoprotein and the co-receptor. 
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Entry of HIV-1 R5 isolates into target cells is mediated by 
the successive interaction of the envelope glycoprotein 
gpl20 with CD4 and the CCR5 co-receptor [3] . Gpl20-CD4 
complex formation generates a large bonding energy that 
drives reordering of the gpl20 core structure [22, 31, 47] . 
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Changes in the orientation of the V1/V2 and V3 loops, as 
well as the bridging sheet (composed of the V1/V2 stem and 
C4) , cooperatively create and/or expose a co-receptor 
binding site on gpl20 [22,37,47]. The predicted co-receptor 
binding surface on gpl20 has a hydrophobic core surrounded 
by a positively charged periphery and is composed of both 
conserved and variable residues located in the C4 domain 
and V3 loop, with lesser contributions from the VI V2 stem 
[22, 36, 37] . 

It has been demonstrated that specific amino acids within 
the CCR5 amino- terminal domain (Nt , amino acids 2-31) , 
including negatively charged and tyrosine residues, are 
essential for CCR5-mediated fusion and entry of R5 and R5X4 
HIV-1 strains [5, 12, 13, 15, 35]. Farzan et al . [16] 
demonstrated that the CCR5 Nt undergoes both O- 
glycosylation and tyrosine sulfation. It is presently not 
known whether O-glycosylation plays a role in co-receptor 
function, but this possibility is suggested by observations 
that serines in the Nt are important for viral entry. 
Inhibition of cellular sulfation pathways, including 
tyrosine sulfation, greatly decreases gpl20 binding to CCR5 
as well as the entry of R5 and R5X4 HIV-1 strains into 
target cells ([16], E.G.C. unpublished data). Post- 
translational sulfation of the tyrosine residues in the 
CCR5 Nt, therefore, may critically modulate the 
susceptibility of target cells to HIV-1 infection in vivo. 

It was demonstrated that a CCR5 Nt -based peptide spanning 
30 residues 2-18 and containing sulf otyrosines in positions 10 
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and 14 specifically associates with soluble gp!20/CD4 
complexes containing envelope glycoproteins from R5 (JR-FL) 
and R5X4 (DH123) but not X4 (LAI) strains [11] . Peptides 
containing unmodified tyrosines or phosphotyrosines , 
5 however; did not bind soluble gpl20/CD4 complexes [11] . The 
tyrosine-sulfated CCR5 Nt therefore specifically interacts 
only with gpl20 proteins from isolates that use this co- 
receptor to gain entry into target cells. Furthermore, only 
the CCR5 Nt-based sulfopeptide inhibits binding of soluble 

10 gpl2 0jr.pl/CD4 to intact, cell surface-expressed CCR5 and 
moderately blocks the entry of the R5 isolate JR-FL. The 
affinity of soluble gpl20/CD4 for the CCR5 Nt sulfopeptide, 
however, is approximately 10-100-fold lower than for the 
native, membrane-associated co-receptor [11, 42, 46],. 

15 suggesting that other gpl20-CCR5 contacts are required to 
consolidate this interaction. This concept is further 
supported by studies of CCR5 chimera, as well as studies 
with inhibitors of CCR5 co-receptor function [12, 34, 38, 
32, 14] . 

20 

A novel ELISA is reported to detect binding of 
sulf opeptides to soluble gpl20/CD4 complexes, as well as 
anti-CCR5 MAbs and chemokines. ELISA and surface plasmon 
resonance (SPR) were used to further delineate the 

25 determinants of the gp!20-CCR5 Nt interaction. In order to 
define the minimal domain of the CCR5 Nt capable of 
specifically binding to soluble gpl20/CD4 complexes, 
sulf opeptides corresponding to different regions of the Nt 
were analyzed. To identify the gpl2 0 domains involved in 

30 sulfopeptide binding, inhibition of gpl2 0/CD4 complex 
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binding to CCR5 Nt sulf opeptides by anti-gpl20 Mabs was 
studied. Residues in or near the epitopes of inhibitory 
MAbs were mutated to alanine, and the gpl20 point mutants 
were compared for their ability to bind to CCR5 Nt 
sulfopeptides and cell-surface CCR5 . The data suggest that 
a mostly conserved surface of gpl20 binds to a nine-residue 
stretch of the CCR5 Nt , whereas more variable residues in 
the crown of the V3 loop may interact with a secondary 
binding site on CCR5 . 

Materials and Methods 



Reagents : CD4-IgG 2 , soluble CD4 (sCD4) , recombinant soluble 
gpl20s from HIV-1^ (X4), HIV-1 DH123 (R5X4), and HIV-l JR _ Flj (R5) 

15 isolates, anti-gp!20 MAb PA1 (directed against the V3 loop 
of JR-FL) and anti-CCR5 MAbs PA8 , PA10, PA11, PA12 , PA14 
were produced by Progenies Pharmaceuticals, Inc. 
(Tarrytown, NY) as described [1, 32]. MAbs 133-290, 133- 
192, 135-9, A32, 17b, 19b, 48d, 9284, G3-42, Cll, G45-60 

20 and 2G12 were a generous gift [26] . The small -molecule CCR5 
antagonist TAK-779 was obtained as described [14] . 



Peptides corresponding to different segments of the CCR5 Nt 
were synthesized as described previously (Table 1) [11] . 
25 Sulfo- or phospho-tyrosines were incorporated in positions 
10 and 14, and all peptides carried a carboxy- terminal Gly- 
Ala-Gly spacer preceding a biotinylated lysine. Residues 
were numbered according to their positions in the full 
length CCR5 protein. 



30 
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Surface plasmon resonance : Binding of gpl2 0/CD4 - IgG 2 complex 
and MAbs to CCR5 Nt -based peptides was measured as 
previously described [11] . Briefly, streptavidin-coated 
sensor chips were divided into two surfaces, each with a 
5 separate flow chamber. The sensor chip was conditioned and 
equilibrated as recommended by the manufacturer. 
Biotinylated peptide (400 resonance units, RU) was bound to 
the surface of the second chamber whereas the first chamber 
of the chip was used as a negative control. Gpl2 0/CD4 - IgG 2 

10 complex (50 nM) was passed over the chip surface in the 
presence or absence of MAbs (150 mM) Surface plasmon 
resonance was monitored and displayed in RU as a function 
of time using a Biacore X. After each measurement the chip 
was regenerated and equilibrated as recommended by the 

15 manufacturer. 

Generation of gp!20 alanine mutants and their binding to 
CD4 - IcG 2 : Mutant gpl20 proteins were generated using the 
QuickChange Kit from Stratagene (San-Diego, CA) . Gpl2 0 JR _ FIj/ 

20 cloned into the pPPI4 expression vector [4] , served as the 
• template for site directed mutagenesis. Nucleotide 
sequencing was performed to ascertain the presence of the 
appropriate mutation in the gpl20 coding sequence. 293T 
cells were calcium phosphate transfected with the different 

25 mutant gpl2 0 expression constructs. Supernatants containing 
soluble gp!2 0 proteins were harvested and cleared of debris 
by centrif ugat ion 24 hours post- transf ection . 

Quantification of gpl20 was performed by ELISA as 
previously described [40] ) . Briefly, 293T supernatants were 

30 boiled for 5 minutes and denatured gpl20 was captured on an 
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ELISA plate coated with D7324 (International Enzymes Inc. 
Fallbrook, CA) , a MAb that recognizes a 15 residue linear 
epitope in the carboxy- terminal end of gpl20. Captured 
gpl20 was detected by a mixture of anti-gpl20 MAbs B12 and 
B13 [40] , followed by incubation with a horseradish 
peroxidase - con j ugat ed (HRP) anti-mouse IgG antibody 
(Amersham Pharmacia, Piscataway, N.J.). Optical density 
(O.D.) was measured at 450 nm using the ImmunoPure TMB 
Substrate kit (Pierce, .Rockford, IL) . 



CD4-IgG 2 binding to non-denatured mutant gpl20 proteins also 
was measured. Plates coated with D7324 were used to capture 
native gpl2 0 from supernatant s of transiently transfected 
293T cells. CD4-IgG 2 (50 nM) was added to the plates and its 
15 binding was detected using an HRP- conj ugated goat anti- 
human IgG and TMB substrate as described above. 

CCR5 Nt peptide ELISAs : Streptavidin- coat ed ELISA plates 
(Pierce, Rockford, IL) were blocked with D-PBS/ 5% BSA for 

20 2 hours at room temperature then washed three times with 
assay buffer (D-PBS/ 0.5% Tween 20/ 1% Fetal Bovine Serum/ 
2% BSA) . Plates were then contacted with sulfo- or phospho- 
peptides (1 jug/ml) for 1 hour at room temperature and 
washed three times with assay buffer. Mixtures of CD4-IgG 2 

25 (50 nM) and purified gpl20 or gpl2 0 - containing supernatants 
in a 1:4 molar ratio were added to the plates for 1 hour at 
room temperature. Plates were washed three times and (HRP) - 
conjugated goat anti-human IgG was used to detect the 
presence of bound CD4-IgG 2 . The plates were developed using 

30 the TMB substrate as described above. Gpl2 0/CD4-IgG 2 binding 
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to the peptides was normalized for CD4-IgG 2 binding to the 
mutant gpl20 proteins. 

In a competition ELISA, peptides were captured onto the 
5 plates as described above. Inhibitor or assay buffer was 
added for 1 h prior to additi on of 9pl 2 0 JR _ FIj / CD4 - 1 gG 2 complex 
(1 nM) for an additional h at room temperature. The assay- 
was then completed as described above. Direct binding of 
anti-CCR5 murine MAbs to the peptides was examined as 
10 described above except that MAb was substituted for 
gpl20/CD4 - IgG 2 complex and a goat anti-mouse HRP-coupled 
antibody was used for detection. 

Binding of opl2 0 /CD4 - JaG ? complexes to cell-surface CCR5 : 
15 L1.2-CCR5 + cells (10 6 ) were incubated with gpl20-containing 
supernatant (lOOnM) and biotinylated CD4-IgG 2 (50nM) for 1 
hour at 37 °C in assay buffer, as previously described [32] . 
Gpl20/CD4 - IgG 2 bound to the cells was revealed by FACS 
analysis of the mean fluorescence of intensity (m.f.i.) 
20 after addition of streptavidin-PE (Pharmingen, San-Diego, 
CA.). Binding was calculated using the formula: (m.f.i. 
gp!20 mutants) / (m.f.i. gpl20 wild type) x 100% and 
normalized for CD4-IgG 2 binding to the mutant gpl2 0 
proteins . 

25 

Results 

An ELISA to detect binding of soluble apl20/CD4 complexes 
to CCR5 Nt -based peptides : Surface plasmon resonance (SPR) 
30 was previously used to show that gp!20/sCD4 complexes 
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specifically interact with a peptide spanning CCR5 residues 
2-18 and containing sulf otyrosines in positions 10 and 14 
(2-18, Table 1) . The on and off rates of complex-pept ide 
binding were extremely rapid and could not be measured 
5 precisely by SPR. The Kd was estimated to be in the 10~ 7 -10~ B 
range. Replacing monomeric sCD4 with tetravalent CD4-IgG 2 , 
however, lead to a dramatic shift in both on and off rates, 
lowering the Kd into the 10" 8 -10" 9 range (Figure 7a) . This 
observation prompted us to develop an EL1SA to directly 
10 detect complex-peptide binding. Streptavidin-coated ELI SA 
plates were used to capture biotinylated, CCR5 Nt-based 
peptides, and then further incubated with soluble 
gp!2 0/CD4-IgG 2 complexes. Complex binding was detected using 
an HRP- conjugated goat ant i- human IgG antibody. 

15 

Sulfopeptide 2-18 bound gpl2 0 JR _ FL /CD4 - IgG 2 with an IC S0 ~lnM, 
and gpl2 0 DH i23/CD4-IgG 2 with an IC 50 ~5nM (Figure 7b) . 
Sulfopeptide 2-18 did not measurably bind CD4-IgG 2 alone or 
in complex with either gpl2 0 LAI or V3 loop-deleted gpl2 0 JR . FL 

20 (Figure 7b and data not shown) . No binding was observed to 
an analogous CCR5 Nt phosphopeptide (2-18 (P), Figure 12) by 
any of the gpl20/CD4-IgG 2 complexes (Figure 7b) . Identical 
patterns of reactivity were observed for gpl20s in complex 
with CD4-y2, a divalent CD4 - immunoglobul in fusion protein 

25 [data not shown, (1)] However, no binding was observed for 
gp!20 in complex with anti-gpl20 MAbs 2G12 and IgGlbl2, 
even though the latter' s epitope overlaps the CD4 binding 
site on gpl20 (data not shown) . Thus the ELISA reproduces 
the critical biological features of cell-surface CCR5/gpl20 

30 interactions, including a dependence upon CCR5 tyrosine 
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sulfation, CD4 , the V3 loop, and the coreceptor usage- 
patterns of the parent viruses. 

Using a competition ELISA, inhibition of gpl2 0jr.pl/CD4 - IgG2 
5 binding to the sulfopeptide 2-18 was enabled with the anti- 
CCR5 MAb PA8 . However, binding of soluble gpl2 0/CD4-IgG 2 
complexes to the sulfopeptide was not inhibited by TAK-779, 
nor the CC-chemokines MlP-la, MIP-lp and RANTES even when 
used at supraphysiologic concentrations. 

10 

Binding of CCR5 Nt peptides to anti-CCR5 MAbs and soluble 
gp!20/CD4 : ELISA was used to test the binding of a panel of 
anti-CCR5 MAbs to peptides 2-18 and 2-18 (P). We had 
previously demonstrated that MAbs PA8 , PA11 and PA12 bind 

15 epitopes in the Nt , PA10 binds an epitope that spans the Nt 
and ECL2 and PA14 binds an epitope exclusively in ECL2 
[32] . Here we show that PA8 avidly binds peptides 2-18 and 
2-18 (P) (Figure 2). PA10 binds avidly to 2-18 and 
moderately to 2-18 (P). PA11 binds moderately to 2-18 and 

20 weakly to 2-18(P). PA12's binding is weak for 2-18 and 
undetectable for 2-18 (P). Finally, PA14 does not recognize 
either the sulfopeptide or the phosphopeptide (Figure 8) . 
Furthermore, PA8 binds similarly to all of the CCR5 Nt- 
based sulf opeptides in Figure 12 (data not shown) . 

25 

In order to more precisely delineate the minimal CCR5 Nt 
domain that specifically binds to soluble gpl20/CD4 
complexes, a panel of sulf opeptides spanning different 
stretches of the CCR5 Nt were synthesized (Figure 12) . All 
30 of the peptides carried sulf otyrosines in positions 10 and 
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14 since we previously showed that these are required for 
complex-peptide binding. Binding of gpl2 0jr.pl/CD4 - IgG 2 to the 
different sulf opeptides was tested by ELISA. Although the 
strongest binding was observed using longest sulf opept ide, 
'2-18, significant binding for peptide 10-18, which 
demonstrated -3 -fold lower avidity was also observed 
(Figure 9) . Peptides 8-15, 6-16 and 10-15 bound the soluble 
complex at least ten-fold less avidly than 2-18. (It was 
previously shown that a sulfopeptide consisting of residues 
10-14 did not bind soluble gp!20/CD4 complexes.) 
Furthermore, the gpl2 0 JR . FL /CD4 - IgG2 complex only weakly 
bound to peptide 10-18 carrying two alanine mutations in 
positions 11 and 18. Previous mutagenesis studies have 
shown that residues Asp-11 and Glu-18 are important for 
fusion, entry and gpl20-CCR5 binding [12, 13]. Finally, it 
should be noted that the same binding patterns to the 
different sulf opept ides were observed with soluble 
complexes containing gpl2 0 DH123 (data not shown) . 

Inhibition of gpl20/CD4 binding to CCR5 Nt sulf opeptidps by 
anti-gpl20 MAbs : In order to determine which domains of 
gpl20 were involved in binding to CCR5 Nt -based 
sulf opeptides , the ability of a panel of well -characterized 
anti-gpl20 MAbs [25] to inhibit binding of either the 
gpl20 JR . FIj /CD4-IgG2 or the gpl20 DK123 /CD4 - IgG2 complex to the 
2-18 sulfopeptide was tested. Only MAbs directed against 
CD4 -induced (CD4i) epitopes and the V3 loop were capable of 
inhibiting binding of the gpl2 0/CD4 - IgG2 complex to the 
CCR5 sulfopeptide. Inhibition of gpl2 0 aR . FL and gpl2 0 DH123 
binding by MAbs 17b and 48d was >90%. The anti-V3 loop MAb 
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19b [28] , which recognizes an epitope in the crown of the 

V3 loop (sequence -I G--FY-T) and is reactive with R5 

strains, inhibited gpl20jR.pL binding >90% and gpl20 DH123 
binding by approximately 80%. Anti-V3 loop MAb PA1 , which 
5 was raised against gpl20jR.PL (W.C.O., unpublished results) 
efficiently inhibited binding of gpl2 0 JR . FL but not gpl20 DH123 . 
Finally, anti-V3 loop MAb 9284 [21] , which recognizes an 
epitope spanning residue 307 to 330 in the V3 loop of X4 
strains, was unable to inhibit binding of either gpl20 

10 protein to the sulf opeptide . MAbs directed against other 
epitopes in other constant and variable .regions of gpl20 
also had no effect on binding of the soluble complex to the 
peptides. Similar results were obtained when the anti-gpl20 
MAbs were used to inhibit soluble complex binding to cell 

15 surface CCR5 (data not shown) . 

p-inrHncr of mutant soluble a r)1 20/C D4 com plexes to CCR5 Nt 
an! fope ptides ; Numerous studies have shown that residues in 
the V3 loop determine co- receptor usage and binding [6-10, 

20 18, 20-21, 23-24, 27, 29, 33, 41-44, 46]. The crystal 
structure of a gpl2 0 lacking the V1/V2 and V3 loops in 
complex with sCD4 and the 17b MAb further implicated a 
conserved, CD4i surface on gpl20, adjacent to the V3 loop, 
in co-receptor binding [36, 37] . Single alanine mutants of 

25 all of the residues near or within regions previously shown 
to be important for co-receptor usage were generated. These 
gpl2 0 mutants were tested for their ability to bind to the 
CCR5-based sulfopeptide 2-18 as well as to cell surface 
CCR5 . Binding was normalized for gp!20 mutant binding to 

30 CD4-IgG 2 . Wild-type levels of binding were observed for all 
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mutants except W427A, R440A and R469A, which bound CD4-IgG 2 
with 5-10-fold lower but nonetheless measurable avidity. 



Residues in both strands of the V3 loop stem were found to 
5 be involved in gp!20 binding to the 2-18 sul f opept ide : 
Alanine mutants of residues R298, N301, T303, 1322, D324, 
1325 and R326 were found to decrease complex binding to the 
peptide by >10-fold. Residues in the crown of the V3 loop, 
including the GPGR motif, .however, had no effect on gpl2 0 

10 binding to the sulf opeptide . C4 residues in or adjacent to 
the two C-terminal (3-strands of the bridging sheet were 
also shown to participate in binding to the sulf opept ide : 
..Alanine substitutions of R419, 1420, K421, Q422 and R444 
decreased complex binding to the sulf opeptide by 5-10-fold. 

15 None of the alanine substitutions that we introduced in the 
other regions of gpl20 significantly affected complex- 
peptide-interactions . 

It was furthermore demonstrated that additional gpl2 0 
20 residues are involved in complex binding to cell surface 
■ expressed CCR5 . Alanine substitutions of residues S306, 
G310, P311, R313, F315, Y316 in the crown of the V3 loop 
decreased complex binding to CCR5 by 5-10-fold. 
Furthermore, alanine substitutions of several residues in 
25 CI, C2 and C3 also had a moderate effect on complex binding 
to CCR5 . Finally, it is noted that alanine substitutions of 
R440 and R469 increased complex binding to both 2-18 and 
CCR5, whereas substitutions of E320 and W427 increased 
complex binding to CCR5 only. 



30 
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Discussion 

Tyrosine- sulfated CCR5 Nt peptides were studied for binding 
to soluble gpl20/CD4 complexes as well as anti-CCR5 MAbs , 
CC-chemokines and TAK-779 using a novel solid-phase ELISA. 
5 ' Inhibition of peptide -complex interactions by anti-gpl20 
MAbs was explored by surface plasmon resonance. These Mabs 
were also tested for their ability to inhibit complex 
binding to cell surface CCR5 . In addition, a panel of 
gpl20 point mutants were generated and then their 

10 reactivity was compared with CCR5 Nt peptides and cell 
surface CCR5 . The principal conclusions are that (1) 
residues 10 to 18 of the CCR5 Nt may define the minimum 
recognition site for gp!20, (2) gpl20 binding to the CCR5 
Nt depends on highly conserved residues located in the C4 

15 domain and the stem of the V3 loop, and (3) gpl20 binding 
to cell surface CCR5 depends on a broader region that 
includes residues in the crown of the V3 loop, CI, C2 and 
C3 . The findings suggest that distinct domains of gpl20 
and CCR5 bind in a multi-step fashion and raise questions 

20 about the determinants of specificity of the co-receptor- 
gpl2 0 interaction . 

An ELISA was developed to detect complex-peptide binding 
based on the observation that the tetravaient gp!20/CD4 -IgG 2 

25 complex binds to CCR5 Nt sulf opeptides ten-to a hundred - 
fold more avidly than the monovalent gpl20/sCD4 complex. 
Complex- sulf opeptide binding was only observed with gpl20 
proteins derived from R5 and R5X4 , but not X4 HIV-1 
strains. V3 loop deleted gpl20 JR _ FL failed to bind to the 

30 sul f opeptides . Phosphopeptides did not bind to any of the 




v> \j uj/mi / j \* - - — • 

-103- 

soluble complexes. Thus, the ELISA reproduces the critical 
biological features of cell-surface CCR5-gpl20 

interactions, including a dependence upon CD4 , CCR5 
tyrosine sulfation, the V3 loop and the co-receptor usage 
5 patterns of the parental viruses. 

CCR5 Nt phosphopeptides and sulf opeptides were 
differentially recognized by anti-CCR5 MAbs in ELISA. PA 8 
possessed equal avidity for sulfated and phophorylat ed 

10 peptides, implying that its epitope does not include 
tyrosine side chains. PA10 and PA11 preferentially 

recognized the sulf cpept ide , albeit with varying 
efficiencies, suggesting that sulf otyrosines participate 
either directly in peptide-MAb interactions or indirectly 

15 by influencing epitope conformations. PA12 only interacted 
with the sulfopeptide and PA14 did not bind either Nt 
peptide. Ir was previously shown that both of these MAbs 
recognize discontinuous epitopes comprising residues in the 
Nt and ECL2 of CGR5 . The observations now imply that ECL2 

20 residues are marginal for PA12 binding and essential for 
PA14 binding to CCR5 . Finally, binding of soluble 

gpl20/CD4 complexes to CCR5 Nt peptides couls be completed 
with an anti-CCR5 Mab but not with either CC-Chemokines or 
TAK-779, whose binding sites have been mapped to other 

25 regions of CCR5 (14, 39). Both CC-chemokines and TAK-779, 
however, are able to compete with gpl2 0/CD4 binding to cell 
surface CCR5, perhaps through steric or comf ormational 
mechanisms. (14, 42, 46). It is noted that Farzan et al . 
reported that a CCR5 Nt sulfopeptide spanning residues 1-22 

30 partially blocks MlP-la binding to cell-surface CCR5 and we 
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attribute the discrepancy to differences in peptides and 
assays (17) . 

In order to more precisely delineate the gpl20 binding site 
5 in the CCR5 Nt , an ELISA was used to test binding of 
soluble complexes to several CCR5 Nt sulf opeptides . 10-18 
was the smallest sulfopeptide that avidly bound soluble 
gpl20/CD4 complexes and may define the minimum docking site 
for gpl20 on CCR5 . In addition to the two sulf otyrosines 

10 in positions 10 and 14, negatively charged amino acids Dll 
and E18 were found to be critical for complex-pept ide 
binding. It was concluded that a cluster of negative 
charges in the CCR5 Nt appears to represent the principal 
recognition motif for gpl20, although residues 2 to 9 

15 further contribute to binding. Similar patterns of peptide 
reactivity were observed for recombinant gpl2 0s derived 
from HIV-1jr.pl (R5) and HIV-1 DH123 (R5X4) , suggesting that the 
CCR5 Nt sulf opeptides recognize conserved structures in the 
envelope glycoprotein. Gpl2 0 DH123/ however, bound about 

20 five-fold less that gpl20 JR . FL to the sulf opeptides , which 
probably accounts for its less efficient usage of CCR5 
(13) . 

Anti-gpl20 MAbs were tested for their ability to inhibit 
25 gpl20/CD4 binding to sulf opeptides or to cell surface CCR5 . 
A number of anti-gpl2 0 MAbs directed against conserved and 
variable regions of the envelope glycoprotein were not 
inhibitory. Only Mabs 48d and 17b, directed against CD4i 
epitopes, and 19b and PA1 , directed against the V3 loop, 
30 efficiently inhibited gpl20 binding to the 2-18 



sulfopeptide and to cell surface CCR5 . The CD4i epitope 
was previously shown to participate in co-receptor binding 
and residues in the V3 loop primarily determine co-receptor 
specificity (36, 37) . The results now suggest that these 
regions of gpl20 determine its association with the CCR5 
Nt . It is noted that inhibition of peptide -complex binding 
by 19b, which recognizes an epitope in the V3 crown, is 
inconsistant with the finding by gpl20 mutagenesis 
experiments that residues in the V3 loop crown do not 
participate in complex-peptide binding. This leads to a 
conclusion that the inhibitory effect of 19b may be steric 
hindrance . 

In order to determine more precisely the. regions of gp!20 
that modulate the gpl20-CCR5 itneraction, the binding of a 
panel of gpl20 point mutants to the CCR5 Nt sulfopeptide 
and to cell surface CCR5 was tested. The mutants were 
created by the introduction of single alanine substitutions 
near or within regions previously shown to be important for 
the integrity of the CD4i epitope and/or CCR5 binding (36, 
37) . Highly conserved residues in C4 and the V3 loop stem, 
including for arginines and a lysine, were found to affect 
binding of gp!2 0 to the CCR5 Nt sulfopeptide (Figure 13) . 
These residues are located in two random coil segments of 
C4 that straddle the V3 loop stem and may constitute a 
positively charged CCR5 Nt binding domain (22) . 
Additional, conserved residues in the crown of the V3 loop, 
CI, C2 and C3 contribute to gpl2 0 binding to cell surface 
CCR5 (Figure 13) . It remains to be determined whether 
these residues interact with other extracellular domains of 
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CCR5 or whether they influence the conformation of C4 and 
the V3 loop stem in a way that is only relevant in the 
context of gpl20/CD4 binding to cell surface CCR5 . It is 
unlikely that these residues also interact with the Nt in 
5 the context of cell surface CCR5 because they are 
relatively distal from the C4 and V3 residues that were 
implicated in sulfopeptide binding (22) . 

To date, several lines- of evidence suggest that gpl20 binds 

10 to more that one region of the CCR5 co-receptor: (1) the 
affinity of gpl20s/CD4 for the CCR5 Nt sulfopeptide is 
approximately 10-100-fold lower that for the native, 
membrane-associated co-receptor (11, 42, 46), (2) co- 
receptor chimera studies implicate the extracellular loops 

15 in viral fusion and entry (2, 12, 34, 38) and (3) 
inhibitors of CCR5 co-receptor function such as Mabs 2D7 
and PA14, as well as TAK-779 do not bind to the CCR5 Nt yet 
block gpl20/CD4 binding to CCR5 (14, 32). The present 
findings could be interpreted to support a distributed 

20 model for gpl2 0-CCR5 'interactions that mirrors the two-site 
paradigm proposed for the interaction of certain chemokines 
with their receptors (30, 45) . In this model, binding is 
initially driven by electrostatic interactions between 
negatively charged residues in the receptor Nt and basic 

25 surfaces on the chemokine ligand. This binding serves to 
orient the ligand and promote its interactions with other 
portions of the chemokine receptor. The V3 loop crown may 
form initial electrostatic interactions with the 
extracellular loops of CCR5, which would allow the CCR5 Nt 

30 to bind to a conserved region of gpl2 0 comprising residues 
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in C4 and the V3 loop stem. Alternatively, the CCR5 Nt 
could first bind the C4/V3 stem domain, which would them 
promote an interaction of the V3 loop with some, other 
region of CCR5 . All of these interactions involve 

additional gpl20 residues that we have yet to identify. 
The role of the putative second interaction is unclear but 
it may further stabilize the gpl20-CCR5 interaction, 
optimally orienting the fusion apparatus, or triggering 
gp41 conformational changes that are required for fusion. 

The findings present us with a seeming paradox wherein nine 
residues of the CCR5 Nt confer specificity on the CCR5- 
gpl20 interaction by binding to gpl20 residues that are 
highly conserved among clade B isolates, regardless of 
their co-receptor usage. However, although the C4 and V3 
stem residues themselves are conserved, their precise 
placement may differ for R5 and X4 viruses. Clearly, 
relatively minor differences in the orientation, exposure 
or relative positioning of these widely separated residues 
could abrogate binding to a short peptide but not a MAb 
(e.g., 17b) possessing a larger, more distributed binding 
site (37). In addition, more variable amino acids (e.g., 
324) within or outside the C4/V3 loop stem may contribute 
to the specif icity - of the gpl20-Nt interaction, and we 
showed that residues N279, R313, P369 and R444 participate 
in gpl2 0/CD4 binding to cell surface CCR5 but not to CCR5 
Nt sulf opeptides . Future studies employing additional 
gpl20 mutants together with CCR5 mutants and CXCR4-based 
sulf opeptides will shed light on the specificity 
determinants of the gp!20-co-receptor interaction. 



WO 01/64710 r^i/uovj/vwujr* 

-108- 

CD4 and CCR5 mediate fusion and entey of R5 HIV-1 strains. 
Sulfotyrosines and negatively charged residues in the CCR5 
Nt are crucial for binding of gpl2 0 and viral antry. 
5 Soluble gpl20/CD4 complexes specifically bind to CCR5 Nt 
peptides containing sulf otyrsinces in positions 10 and 14. 
CCR5 Nt sulfotyrosines inhibit gpl20/CD4 binding to CCR5 as 
well as viral entry. Residues in the V3 loop and the C4 
region og gp!2 0 compose a binding site for the CCR5 amino 

10 terminal domain. Redisues 10-18 of the CCR5 Nt constitute a 
minimal binding domain for gpl20: sulfotyrosines Y10 and 
Y14 and negatively charged residues Dll and E18 are 
important for binding. The CCR5 Nt terninal binding site on 
gpl20 is composed mostly of residues in the V3 loop stem 

15 and the C4 domain. 
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What is claimed: 

1. A compound comprising the structure: 
6aYDINYYTSE3A 

wherein each T represents a threonine, each S 
5 represents a serine, each E represents a glutamic 

acid, each Y represents a tyrosine; each D represents 
an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
10 proviso that if there are j more than 2 amino acids, 

they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom in the amino terminal 
15 direction; 

wherein (3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
20 forth in SEQ ID NO: 1 beginning with the P at position 

19 and extending therefrom in the carboxy terminal 
direction; 

wherein 6 represents an amino group or an acetylated 
amino group; wherein X represents a carboxyl group or 
25 an ami dated carboxyl group; 

wherein all of a, Y, D, I , N, Y, Y, T, S , E and p are joined 
together by peptide bonds ; 

further provided that at least two tyrosines in the 
compound are sulfated. 

30 
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2. The compound of claim 1, wherein 3 represents less 
than 17 amino acids. 

3. The compound of claim 1, wherein 3 represents less 
5 than 16 amino acids. 

4. The compound of claim 1, wherein 3 represents less 
than 15 amino acids. 

10 5. The compound of claim 1, wherein 3 represents less 
than 14 amino acids. 

6. The compound of claim 1, wherein 3 represents less 
than 13 amino acids. 

15 

7. The compound of claim 1, wherein (3 represents less 
than 12 amino acids. 

8. The compound of claim 1, wherein 3 represents less 
20 than 11 amino acids. 

9. The compound of claim 1, wherein 3 represents less 
than 10 amino acids. 

25 10. The compound of claim 1, wherein 3 represents less 
than 9 amino acids . 



11. The compound of claim 1, wherein 3 represents less 
than 8 amino acids . 

30 
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12. The compound of claim 1, wherein (3 represents less 
than 7 amino acids. 

13. The compound of claim 1, wherein (3 represents less 
5 than 6 amino acids. 

14. The compound of claim 1, wherein (3 represents less 
than 5 amino acids. 

10 15. The compound of claim 1, wherein 3 represents less 
than 4 amino acids. 

16. The compound of claim 1, wherein (3 represents less 
than 3 amino acids. 

15 

17. The compound of claim 1, wherein 3 represents less 
than 2 amino acids. 

18. The compound of claim 1, wherein 3 represents less 
20 than 1 amino acid. 

19. The compound of claim 1, wherein a represents less 
than 9 amino acids. 

25 20. The compound of claim 1, wherein a represents less 
than 8 amino acids. 

21. The compound of claim 1, wherein a represents less 
than 7 amino acids. 
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22. The compound of claim 1, wherein a represents less 
than 6 amino acids. 

23. The compound of claim 1, wherein a represents less 
5 than 5 amino acids. 

24. The compound of claim 1, wherein a represents less 
than 4 amino acids. 

10 25. The compound of claim 1, wherein a represents less 
than 3 amino acids. 

26. The compound of claim 1, wherein a represents less 
than 2 amino acids. 

15 

27. The compound of claim 1, wherein a represents less 
than 1 amino acid. 

28. A composition comprising the compound of claim 1 and a 
20 detectable marker attached thereto. 

29. The composition of claim 28, wherein the detectable 
marker is biotin. 

25 30. The composition of claim 28, wherein the detectable 
marker is attached at the C- terminus of the compound. 

31. A composition which comprises a carrier and an amount 
of the compound of claim 1 effective to " inhibit 
30 binding of HIV-1 to a CCR5 receptor on the surface of 



a CD4+ cell. 



A method of inhibiting human immunodeficiency virus 
infection of a CD4+ cell which also carries a CCR5 
receptor on its surface which comprises contacting the 
CD4 + cell with an amount of the compound of claim 1 
effective to inhibit binding of human immunodeficiency 
virus to the CCR5 receptor so as to thereby inhibit 
human immunodeficiency virus infection of the CD4 + 
cell . 

The method of claim 32, wherein the CD4+ cell is 
present in a subject and the contacting is effected by 
administering the compound to the subject. 

A method of preventing CD4+ cells of a subject from 
becoming infected with human immunodeficiency virus 
which comprises administering to the subject an amount 
of the compound of :;laim 1 effective to inhibit 
binding of human i rnunodef iciency virus to CCR5 
receptors on the surface of the CD4+ cells so as to 
thereby prevent the subject's CD4+ cells from becoming 
infected with human immunodeficiency virus. 

A method of treating a subject whose CD4+ cells are 
infected with human immunodeficiency virus which 
comprises administering to the subject an amount of 
the compound of claim 1 effective to inhibit binding 
of human immunodeficiency virus to CCR5 receptors on 
the surface of the subject's CD4 + cells so as to 
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thereby treat the subject. 

36. The method of any one of claims 33-35, wherein the 
compound is administered by aerosol, intravenous, oral 
5 or topical route. 



37. The method of claim 33 or 35, wherein the subject is 
infected with HIV-1 prior to administering the 
compound to the subject. 

10 

38. The method of claim 33 or 34, wherein the subject .is 
not infected with HIV-1 prior to administering the 
compound to the subject. 

15 39. The method of claim 38, wherein the subject is not 
infected with, but has been . exposed to, human 
immunodeficiency virus. 

40. The method of any one of claims 33-35, wherein the 
20 effective amount of the compound comprises from about 

1.0 ng/kg to about 100 mg/kg body weight of the 
subject. 

41. The method of claim 40, wherein the effective amount 
25 of the compound comprises from about 100 ng/kg to 

about 50 mg/kg body weight of the subject. 
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The method of claim 41, wherein the effective amount 
of the compound comprises from about 1 /ig/kg to about 
10 mg/kg body weight of the subject. 



The method of claim 42, wherein the effective amount 
of the compound comprises from about 10 0 pig/ kg to 
about 1 mg/kg body weight of the subject. 

The method of any one of claims 33-35, wherein the 
subject is a human being. 

A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises : 

(a) immobilizing the compound of claim 1 on a solid 
support ; 

(b) contacting the immobilized compound from step (a) 
with sufficient detectable CCR5 ligand to 
saturate all binding sites for the CCR5 ligand on 
the immobilized compound under conditions 
permitting binding of the CCR5 ligand to the 
immobilized compound so as to form a complex ,- 

(c) removing any unbound CCR5 ligand; 

(d) contacting the complex from step (b) with the 
agent; and 

(e) detecting whether any CCR5 ligand is displaced 
from the complex, wherein displacement of 
detectable CCR5 ligand from the complex indicates 
that the agent binds to the compound so as to 
thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
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comprises : 

(a) contacting the compound of claim 1 with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the compound 
under conditions permitting binding of the CCR5 
ligand to the compound so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of CCR5 ligand which is 
bound to the compound in the complex; 

(d) contacting the complex from step (a) with the 
agent so as to displace CCR5 ligand from the 
complex; 

(e) measuring the amount of CCR5 ligand which is 
bound to the compound in the presence of the 
agent ; and 

(f) comparing the amount of CCR5 ligand bound to the 
compound in step (e) with the amount measured in 
step (c) , wherein a reduced amount measured in 
step (e) indicates that the agent binds to the 
compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to 
the CCR5 receptor. 

A . method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises : 

(a) immobilizing the compound of claim 1 on on a 
solid support; 

(b) contacting the immobilized compound from step (a) 
with the agent and detectable CCR5. ligand under 



conditions permitting binding of the CCR5 ligand 
to the immobilized compound so as. to form a 
complex; 

(c) removing any unbound CCR5 ligand; 

(d) measuring the amount of detectable CCR5 ligand 
which is bound to the immobilized compound in the 
complex; 

(e) " measuring the amount of detectable CCR5 ligand 

which binds to the immobilized compound in the 
absence of the agent ; 

(f ) comparing the amount of CCR5 i ligand which is 
bound to the immobilized compound in step (e) 
with the amount measured in step (d) , wherein a 
reduced amount measured in step (d) indicates 
that the agent binds to the compound or CCR5 
ligand so as to thereby identify the agent as one 
which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 

The method of claim 47, wherein the amount of the 
detectable ligand in step (a) and step (e) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 

A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises : 

(a) contacting the compound of claim 1 with the agent 
and detectable CCR5 ligand under conditions 
permitting binding of the CCR5 ligand to the 



compound so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of detectable CCR5 ligand 
which is bound to the compound in the complex; 

(d) measuring the amount of detectable CCR5 ligand 
which binds to the compound in the absence of the 
agent ; 

(e) comparing the amount of CCR5 ligand which is 
bound to the compound in step (c) with the amount 
measured in step (d) , wherein a reduced amount 
measured in step (c) indicates that the agent 
binds to the compound or CCR5 ligand so as to 
thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

The method of claim 49, wherein the amount of the 
detectable ligand in step (a) and step (d) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 



The method of any one of 
detectable CCR5 ligand is 
marker . 



claims 45-50, wherein the 
labeled with a detectable 



A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises: 

a) immobilizing the compound of claim 1 on a solid 
support ; 

b) contacting the immobilized compound from step a) 
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with the agent dissolved or suspended in a known 
vehicle and measuring the binding signal 
generated by such contact; 

contacting the immobilized compound from step a) 
with the known vehicle in the absence of the 
compound and measuring the binding signal 
generated by such contact; 

comparing the binding signal measured in step b) 
with the binding signal measured in step c) , 
wherein an increased amount measured in step b) 
indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 

15 53. The method of claim 52, wherein the solid support is a 
surface plasmon resonance sensor chip. 

54. The method of claim 52 or 53, wherein the binding 
signal is measured by surface plasmon resonance. 

20 

•55- A method of obtaining a composition which comprises: 

(a) identifying a compound which inhibits binding of 
a CCR5 ligand to a CCR5 receptor according to the 
method of any one of claims 45-50 and 52; and 
25 (b) admixing the compound so identified or a homolog 

or derivative thereof with a carrier. 



c) 



d) 



10 
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The method of any one of claims 45-50 and 52, wherein 
the CCR5 ligand is a complex comprising an HIV-1 
envelope glycoprotein and a CD4 -based protein. 
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The method of claim 56, wherein the HIV-1 envelope 
glycoprotein is gpl20, gpl40 or gpl60 . 

The method of claim 56, wherein the CD4 -based protein 
is soluble CD4 or CD4-IgG2. 

The method of any one of claims 45-50 and 52, wherein 
the CCR5 ligand is a chemokine . 

The method of claim 59, wherein the chemokine .is 
RANTES, MlP-la or MIP-lp. 

The method of any one of claims 45-5 0 and 52, wherein 
the CCR5 ligand is an antibody. 

The method of claim 61, wherein the antibody is 
selected from the group consisting of PA 8 (ATCC 
Accession No. HB-12605) , PA10 (ATCC Accession 
No. 12607), PA11 (ATCC Accession No.. HB-12608), PA12 
(ATCC Accession No. HB-12 6 0 9) . 

The method of claim 45 or 47, wherein the solid 
support is a microtiter plate well, a bead or surface 
plasmon resonance sensor chip. 

A compound having the structure : 

A- <aYDINYYTSE(3A) „ 
wherein each T represents a threonine, each S 
represents a serine, each E represents a glutamic 
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acid, each Y represents a tyrosine; each D represents 
an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
I at position 9 and extending therefrom in the amino 
terminal direction; 

wherein (3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
P at position 19 and extending therefrom in the 
carboxy terminal direction ; 

wherein A represents a carboxyl group or an amidated 
c arboxy 1 group ; 

wherein all of of,Y,D, I,N,Y,Y,T, S,E and p are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein n is an integer from 1 to 8, A is a polymer, 
and the solid line represents up to 8 linkers which 
attach the structure in parentheses to A. 

A compound having the structure : 
(6aYDINYYTSEP) n -A 

wherein each T represents a threonine, each S 
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represents a serine, each E represents a glutamic 
acid, each Y represents a tyrosine; each D represents 
an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined together ' by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
I at position 9 and extending therefrom in the amino 
terminal direction; , 

wherein (3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
P at position 19 and extending therefrom in the 
carboxy terminal direction; 

wherein 0 represents an amino group or an acetylated 
amino group; 

wherein all of a, Y , D, I , N, Y , Y, T , S , E and |3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein n is an integer from 1 to 8 , A is a polymer, 
and the solid line represents up to 8 linkers which 
attach the structure in parentheses to A. 

The compound of claim 64 or 65, wherein the polymer is 
selected from the group consisting of a linear lysine 



polymer, a branched lysine polymer, a linear arginine 
polymer, a branched arginine polymer, polyethylene 
glycol, a linear acetylated lysine polymer, a branched 
acetylated lysine polymer, a linear chloroacetylated 
lysine polymer and a branched chloroacetylated lysine 
polymer . 

The compound of claim 1, wherein the compound is a 
peptide which comprises consecutive amino acids having 
the sequence YDINYYTSE . 

The compound of claim 67, wherein the tyrosines at 
positions 1 and 5 of the sequence YDINYYTSE are 
sulfated. 

A compound comprising the structure: 

eaYDnnYnnnEpx 

wherein each E represents a glutamic acid, and each Y 
represents a tyrosine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom in the amino terminal 
direction; 

wherein £ represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 



forth in SEQ ID NO: 1 beginning with the P at position 
19 and extending therefrom in the carboxy terminal 
direction; 

wherein 0 represents an amino group or an acetylated 
amino group; wherein X represents a carboxyl group or 
an amidated carboxyl group ; 
wherein n represents any amino acid, 

wherein all of ex, Y, D, n, n, Y, n, n, n, E and |3 are joined 
together by peptide bonds; 

further provided that at least two tyrosines in the 
compound are sulfated. 

The compound of claim 69, wherein the compound is a 
peptide which comprises consecutive amino acids have 
the sequence YDnnYIIIIIIE . 

The compound of claim 70, wherein the tyrosines at 
positions 1 and 5 of the sequence YDnilYnnnE are 
sulfated . 



A compound comprising the structure: 
SaYDINYYTSEpX 

wherein each T represents a threonine, each S 
represents a serine, each E represents a glutamic 
acid, each Y represents a tyrosine; each D represents 
an aspartic acid, each 1 represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 



and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom in the amino terminal 
direction; 

wherein (3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the P at position 
19 and extending therefrom in the carboxy terminal 
direction; I 

wherein 6 represents an amino group, or an acetylated 
amino group; wherein X represents a carboxyl group or 
an amidated carboxyl group; 

wherein all of a, Y, D, I , N, Y, Y, T, S , E and .(3 are joined 
together by peptide bonds ; 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein any amino acid except for the Y at position 1, 
D at position 2, Y at position 5 and E at position 9 
may be replaced with a homologous amino acid. 

The compound of claim 72, wherein any I amino acid 
residue is be replaced with a G,A,V or L amino acid 
residue . 

The compound of claim 72, wherein any N amino acid 
residue is replaced with a Q amino acid residue. 



The compound of claim 72, wherein any Y amino acid 




-133- 

residue is replaced with an 

76. The compound of claim 72, 
residue is replaced with an 

5 

77. The compound of claim 72, 
with a T amino acid residue 

78 . The compound of claim 72, 
10 with an M, S, T, A, G, N, o: 




F or W amino acid residue. 

wherein any T amino acid 
S amino acid residue. 

wherein any S is replaced 

wherein any C is replaced 
• Q amino acid residue. 
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SEQUENCE LISTING 

<110> Progenies Pharmaceuticals, Inc., et al. 

<120> SULFATED CCR5 PEPTIDES FOR HTV-1 INFECTION 

<130> 2048/61 01 0-A-PCT/JPW/SHS/AX 

<140> NOT YET KNOWN 

<141> 2001-02-28 

<160> 17 

<170> Patentln version 3.0 j 
<210> 1 i 
<211> 352 , 
<212> PRT 
<213> human 
<400> 1 

Met Asp Tyr Gin Val Ser Ser Pro He Tyr Asp He Asn Tyr Tyr Thr 
15 10 15 

Ser Glu Pro Cys Gin Lys He Asn Val Lys Gin He Ala Ala Arg Leu 
20 25 30 

Leu Pro Pro Leu Tyr Ser Leu Val Phe He Phe Gly Phe Val Gly Asn 
35 40 45 

Met Leu Val He Leu He Leu He Asn Cys Lys Arg Leu Lys Ser Met 
50 55 60 

Thr Asp lie Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 75 80 

Leu Tin- Val Pro Phe Tip Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 
85 90 95 



Gly Asn Thr Met Cys Gin Leu Leu Thr Gly Leu Tyr Phe He Gly Phe 
100 105 110 

Phe Ser Gly He Phe Phe He lie Leu Leu Thr He Asp Arg Tyr Leu 
115 120 125 

Ala Val Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thr Phe 
130 135 140 

Gly Val Val Tin- Ser Val He Thr Tip Val Val Ala Val Phe Ala Ser 
145 150 155 160 

Leu Pro Gly He He Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 
165 170 175 

Thr Cys Ser Ser His Phe Pro Tyr Ser Gin Tyr Gin Phe Tip Lys Asn 
180 185 190 

Phe Gin Thr Leu Lys He Val He Leu Gly Leu Val Leu Pro Leu Leu 
195. 200 205 

Val Met Val He Cys Tyr Ser Gly He Leu Lys Thr Leu Leu Arg Cys 
210 215 220 

Arg Asn Glu Lys Lys Arg His Arg Ala Val Arg Leu lie Phe Thr lie 
225 230 235 240 

Met He Val Tyr Phe Leu Phe Tip Ala Pro Tyr Asn He Val Leu Leu 
245 250 255 

Leu Asn Tin- Phe Gin Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser 
260 265 270 

Asn Arg Leu Asp Gin Ala Met Gin Val Thr Glu Thr Leu Gly Met Thr 
275 280 285 

His Cys Cys He Asn Pro He He Tyr Ala Phe Val Gly Glu Lys Phe 
290 295 300 

Arg Asn Tyr Leu Leu Val Phe Phe Gin Lys His He Ala Lys Arg Phe 
305 310 315 320 



Cys Lys Cys Cys Ser He Phe Gin Gin Glu Ala Pro Glu Arg Ala Ser 
325 330 335 



Ser Val Tyr Thr Arg Ser Thr Gly Glu Gin Glu He Ser Val Gly Leu 
340 345 350 



<210> 2 
<211> 1376 
<212> DNA 
<213> human 

<400> 2 

gaattccccc aacagagcca agctctccat ctagtggaca gggaagctag cagcaaacct 60 

tcccttcact acaaaacttc attgcttggc caaaaagaga gttaattcaa tgtagacatc 1 20 

i 

tatgtaggca attaaaaacc tattgatgta taaaacagtt tgcattcatg gagggcaact 1 SO 
aaatacattc taggacttta taaaagatca ctttttattt atgcacaggg tggaacaaga 240 
tggattatca agtgtcaagt ccaatctatg acatcaatta ttatacatcg gagccctgcc 300 
aaaaaatcaa tgtgaagcaa atcgcagccc gcctcctgcc tccgctctac tcactggtgt 360 
tcatctttgg ttttgtgggc aacatgctgg tcatcctcat cctgataaac tgcaaaaggc 420 
tgaagagcat gactgacatc tacctgctca acctggccat ctctgacctg tttttccttc 480 
ttactgtccc cttctgggct eactatgctg ccgcccagtg ggactttgga aatacaatgt 540 
gtcaactctt gacagggctc tattttatag gcttcttctc tggaatcttc ttcatcatcc 600 
tcctgacaat cgataggtac ctggctgtcg tccatgctgt gtttgcttta aaagccagga 660 
cggtcacctt tggggtggtg acaagtgtga tcacttgggt ggtggctgtg tttgcgtctc 720 
tcccaggaat catctttacc agatctcaaa aagaaggtct tcattacacc tgcagctctc 780 
attttccata cagtcagtat caattctgga agaatttcca gacattaaag atagtcatct 840 
tggggctggt cctgccgctg cttgtcatgg tcatctgcta ctcgggaatc ctaaaaactc 900 



tgcttcggtg tcgaaatgag aagaagaggc acagggctgt gaggcttatc ttcaccatca 960 
tgattgttta ttttctcttc tgggctccct acaacattgt ccttctcctg aacaccttcc 1 020 
aggaattctt tggcctgaat aattgcagta gctctaacag gttggaccaa gctatgcagg 1080 
tgacagagac tcttgggatg acgcactgct gcatcaaccc catcatctat gcctttgtcg 1 140 
gggagaagtt cagaaactac ctcttagtct tcttccaaaa gcacattgcc aaacgcttct 1200 
gcaaatgctg ttctattttc cagcaagagg ctcccgagcg agcaagctca gtttacaccc 1260 
gatccactgg ggagcaggaa atatctgtgg gcttgtgaca cggactcaag tgggctggtg 1320 
acccagtcag agttgtgcac atggcttagt tttcatacac agcctgggct gggggt 1376 

<210> 3 
<211> 49 
<212> PRT 

<213> human immmaodeficiency virus 
<400> 3 

Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg 
1 5 10 15 

Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Tip Gly He 
20 25 30 

Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp 
35 40 45 

Gin 



<210> 4 
<211> 35 



<212> PRT 

<213> human immunodeficiency virus 
<400> 4 

Tip Met Glu Tip Asp Arg Glu He Asn Asn Tyr Tlir Ser Leu He His 
15 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 

Leu Leu Glu 
35 

j 

<210> 5 1 
<211> 22 ! 
<212> PRT 
<213> human 

<400> 5 

Leu Leu Thr Val Glu Gin Ala Leu Ala Asp Phe Ala Glu Leu Leu Arg 
1 5 10 15 

Ala Leu Arg Arg Asp Leu 
20 

<210> 6 
<211> 33 
<212> PRT 



<213> human 



<400> 6 

His Met Lys Gin Leu Glu Asp Lys Val G]u Glu Leu Leu Ser Lys Asn 
1 5 10 15 

Tyr His Leu Glu Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu 
20 25 30 

Arg 

<210> 7 
<211> 33 
<212> PRT 
<213> human 

<400> 7 

His Met Lys Gin He Glu Asp Lys lie Glu Glu lie Leu Ser Lys lie 
1 5 10 15 

Tyr His He Glu Asn Glu He Ala Arg He Lys Lys Leu lie Gly Glu 
20 25 30 

Val 

<210> 8 
<211> 40 
<212> PRT 
<213> human 



<400> 8 

Leu Thr Asp Thx Leu Gin Ala Glu Thr Asp Gin Leu Glu Asp Glu Lys 
15 10 15 

Ser Ala Lexi Gin Thr Glu He Ala Asn Leu Leu Lys Glu Lys Glu Lys 
20 25 30 

Leu Glu Phe He Leu Ala Ala Arg 
35 40 

<210> 9 

<211> 40 

<212> PRT 1 

i 
i 

<213> human 



<400> 9 

His Met Arg Arg He Ala Arg Leu Glu Glu Lys Val Lys Thr Leu Lys 
15 10 15 

Ala Gin Asn Ser Glu Leu Ala Ser Thr Ala Asn Met Leu Arg Glu Gin 
20 25 30 

Val Ala Gin Leu Lys Gin Lys Tyr 
35 40 



<210> 10 
<211> 36 
<212> PRT 
<213> unknown 



8 



<220> 

<221> PEPTIDE 
<222> (l)-(36) 
<223> T-20 

<400> 10 

Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Tip Ala Ser Leu 
20 25 30 

Trp Asn Trp Phe 
35 

<210> 11 
<211> 38 
<212> PRT 
<213> unknown 

<220> 

<221> PEPTIDE 
<222> (1)..(3S) 
<223> DP 107 



<400> 11 

Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
15 10 15 

Tin- Val Trp Gly He Lys Gin Leu Gin Ala Arg lie Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Lys Asp Gin 
35 

<210> 12 
<211> 34 

<212> PRT ! 

<213> unknown ! 

i 

<220> 

<221> PEPTIDE 
<222> (1)..(34) 
<223> N34 

<400> 12 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
1 5 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Tip Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg 




10 

<210> 13 
<211> 28 
<212> PRT 
<213> unknown 

<220> 

<221> PEPTIDE 
<222> (1)..(28) 
<223> C28 

<400> 13 

Tip Met Glu Tip Asp Arg Glu He Asn Asn Tyr Thr Ser Leu He His 
15 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys 
20 25 

<210> 14 

<211> 68 

<212> PRT 

<213> unknown 

<220> 

<221> PEPTIDE 
<222> (1)..(68) 



11 

<223> N34(L6)C28 
<400> 14 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Tip Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Ser Gly Gly Arg Gly Gly Tip Met Glu Tip Asp Arg Glu He 
35 40 45 

Asn Asn Tyr Tin- Ser Leu He His Ser Leu lie Glu Glu Ser Gin Asn 

50 . 55 60 | 

Gin Gin Glu Lys i 

65 ! 

j 

<210> 15 
<211> 39 
<212> PRT 
<213> unknown 

<220> 

<221> PEPTIDE 

<222> (1)..(39) 

<223> T1249 

<400> 15 

Tip Gin Glu Tip Glu Gin Lys He Thr Ala Leu Leu Glu Gin Ala Gin 
15 10 15 



12 



He Gin Gin Glu Lys Asn Glu Tyr Glu Leu Gin Lys Leu Asp Lys Tip 
20 25 30 

Ala Ser Leu Trp Glu Trp Phe 
35 

<210> 16 
<211> 502 
<212> PRT 

<213> human immunodeficiency virus 



<400> 16 

| 

Met Arg Val Lys Gly He Arg Lys Ser Tyr Gin Tyr Leu Trp Lys Gly 
15 10 15 , 

Gly Tin- Leu Leu Leu Gly He Leu Met He Cys Ser Ala Val Glu Lys 
20 25 30 

Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 
35 40 45 

Thr Tin- Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
50 55 60 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 

Gin Glu Val Val Leu Glu Asn Val Thr Glu His Phe Asn Met Tip Lys 
85 90 95 

Asn Asn Met Val Glu Gin Met Gin Glu Asp He lie Ser Leu Trp Asp 
100 105 110 

GLn Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

Asn Cys Lys Asp Val Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly 
130 135 140 



13 

Tin- Met Glu Arg Gly Glu He Lys Asn Cys Ser Phe Asn He Thr Thr 
145 150 155 160 

Ser He Arg Asp Glu Val Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 
165 170 175 

Asp Val Val Pro He Asp Asn Asn Asn Thr Ser Tyr Arg Leu He Ser 
180 185 190 

Cys Asp Tin- Ser Val He Thr Gin Ala Cys Pro Lys lie Ser Phe Glu 
195 200 205 

Pro lie Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu Lys 
210 215 220 

Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser 
225 230 235 240 

Thr Val Glrr Cys Thr His Gly He Arg Pro Val Val Ser Thr Gin Leu 
245 250 255 

Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val He Arg Ser Asp 
260 265 270 

Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin Leu Lys Glu Ser 
275 280 285 

Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He 
290 295 300 

His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Glu He He Gly 
305 310 315 320 

Asp He Arg Gin Ala His Cys Asn lie Ser Arg Ala Lys Trp Asn Asp 
325 330 335 

Thr Leu Lys Gin He Val He Lys Leu Arg Glu Gin Phe Glu Asn Lys 
340 345 350 

Tin- He Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu He Val Met 
355 360 365 

His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin 
370 375 380 




14 

Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr 
385 390 395 400 

Glu Gly Asn Thr He Thr Leu Pro Cys Arg He Lys Gin He He Asn 
405 410 415 

Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro He Arg Gly 
420 425 430 

Gin lie Arg Cys Ser Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp 
435 440 445 

Gly Gly He Asn Glu Asn Gly Thr Glu He Phe Arg Pro Gly Gly Gly 
450 455 460 

Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val 
465 470 475 480 

Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val 
485 490 495 ; 

Val Gin Arg Glu Lys Arg 
500 



<210> 17 
<211> 511 
<212> PRT 

<213> human immunodeficiency virus 



<400> 17 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 

Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Thr Glu 
20 25 30 

Lys Leu Tip Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 



15 



35 40 45 

Thr Tin- Tin- Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
50 55 60 

Val His Asn Val Tip Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 SO 

Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Tip 
85 90 95 

Lys Asn Asp Met Val Glu Gin Met His Glu Asp He He Ser Leu Tip 
100 105 110 

Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 
115 120 125 

Leu Lys Cys Tin- Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser 
130 135 140 

Gly Arg Met He Met Glu Lys Gly Glu He Lys Asn Cys Ser Phe Asn 
145 " 150 155 160 

He Ser Tin- Ser He Arg Gly Lys Val Gin Lys Glu Tyr Ala Phe Phe 
165 170 175 

Tyr Lys Leu Asp He He Pro He Asp Asn Asp Thr Thr Ser Tyr Lys 
180 185 190 

Leu Tlir Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro Lys Val 
195 200 205 

Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala 
210 215 220 

He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr 
225 230 235 240 

Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val Ser 
245 250 255 

Tin- Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val He 
260 265 270 



16 



Ai-g Ser Val Asn Phe Thr Asp Asn Ala Lys Thr He lie Val Gin Leu 
275 280 285 

Asn Tin- Ser Val Glu He Asn Cys Tin- Arg Pro Asn Asn Asn Thr Arg 
290 295 300 

Lys Arg He Arg He Gin Aj-g Gly Pro Gly Arg Ala Phe Val Thr He 
305 " 310 315 320 

Gly Lys He Gly Asn Met Arg Gin Ala His Cys Asn He Ser Arg Ala 
325 330 335 

Lys Tip Asn Asn Thr Leu Lys Gin He Ala Ser Lys Leu Arg Glu Gin 
340 345 350 

Phe Gly Asn Asn Lys Thr He He Phe Lys Gin Ser Ser Gly Gly Asp 
355 360 365 

Pro Glu He Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 
370 375 380 

Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Tip 
385 390 395 400 

Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr He Thr Leu 
405 410 415 

Pro Cys Arg lie Lys Gin He lie Asn Met Tip Gin Lys Val Gly Lys 
420 425 430 

Ala Met Tyr Ala Pro Pro lie Ser Gly Gin He Arg Cys Ser Ser Asn 
435 440 445 

lie Tlir Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asn Glu 
450 455 460 

Ser Glu He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Tip Arg 
465 470 475 480 

Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val 
485 490 495 

Ala Pro Tin- Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg 
500 505 510 



